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VECTORS FOR OCULAR TRANSDUCTION AND USE THEREOF FOR GENETIC 

THERAPY 

Work described herein was supported by NIH grants EY1 1431 and 
HL54352. The government has certain rights in such subject matter. 
5 RELATED APPLICATIONS 

This application claims the benefit of priority to U.S. application Serial No. 
09/562,934, filed May 1, 2000, to Glen R. Nemerow, Daniel Von Seggern,; 
Martin Friedlander, entitled "VECTORS FOR OCULAR TRANSDUCTION AND USE 
THEREFOR FOR GENETIC THERAPY". 
10 This application is related to copending U.S. application Serial 

No. 09/482,682 (also filed as International PCT application No. 
PCT/USOO/00265, filed January 14, 2000)), to Daniel Von Seggern, -Glen R. 
Nemerow, Paul Hallenbeck, Susan Stevenson, Yelena Skripchenko, filed January 
1.4, 2000, entitled "Adenovirus Vectors, Packaging Cell Lines,-Compositions, and 
15 Methods for Preparation and Use," which is a continuation-in-part of U.S. 

Application 09/423,783 filed November 12, 1999 and claims the benefit of the 
filing date of U.S. Provisional Application 60/1 1 5,920 filed January 14, 1999. 
Where permitted, the contents and subject matter of each application and of the 
provisional application are incorporated in their entirety herein by reference. 
20 FIELD OF INVENTION 

The present invention relates to gene therapy, especially to adenovirus 
vector-based gene therapy. In particular, adenovirus vectors for therapy of 
ocular diseases and methods of treatment using the vectors are provided. 
Compositions, kits, and methods of preparation and use of the vectors for gene 
25 therapy are provided. 

BACKGROUND OF THE INVENTION 
Retinal dystrophies 

The eye is susceptible to a number of hereditary and/or age related 
degenerative disorders. In the United States, common causes of irreversible 
30 blindness or severe loss of vision are retinal dystrophies (see, e.g., Cotlier eta/. 
(1995) Surv. Ophthalmology 40:51-61; Bird (1995) Am. J. Ophthal. Jt9: 
543-562; and Adler (1 996) Arch Ophthal 1 74:79-83). The retina is the sensory 
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tunic of the eye, containing light sensitive receptors, a complex of neurons, and 
pigmented epithelium, arranged in discrete layers. In humans, the macula is the 
portion of the retina that lies directly behind the lens. Cones, the photoreceptor 
cells responsible for central vision, are heavily concentrated in the macula. 
5 Central dystrophies, which affect the macula, include Best's disease, age-related 
macular degeneration, and Stargardt's macular dystrophy. The peripheral retina 
is composed mainly of rods, which are responsible for side and night vision. 
Peripheral degenerative retinal diseases include retinitis pigmentosa, choroidemia 
and Bietti's crystalline dystrophy. 
10 Macular degenerations are a heterogenous group of diseases, 

characterized by progressive central vision loss and degeneration of the macula 
and underlying retinal pigmented epithelium. Age-related macular degeneration 
(ARMD) is the most common form of the disease, affecting an estimated 20% of 
persons over 75 years of age. ARMD is poorly understood in terms of etiology 

15 and pathogenesis. The very late onset of the disease has made genetic mapping 
particularly difficult. Certain macular degenerative conditions with a clear genetic 
basis, such as Stargardt's and Best's diseases, share many features with ARMD, 
but have been more amenable to molecular and genetic analysis. 

Hereditary peripheral retinopathies are also relatively common. Retinitis 

20 pigmentosa (RP), for example, affects approximately 1.5 million people 
worldwide. Substantial genetic heterogeneity has been observed in this 
condition, with over 20 chromosomal loci identified. A predisposition to retinitis 
pigmentosa can be inherited by autosomal dominant, autosomal recessive, 
X-linked or digenic mode. Mutations have been identified in seven genes, four of 

25 which encode proteins in the rod phototransduction cascade: rhodopsin, alpha 
and beta subunits of rod cGMP phosphodiesterase, and rod cGMP-cation-gated 
channel protein .alpha, subunit. Mutations in the peripherin/RDS gene have been 
linked to retinitis pigmentosa and macular degeneration. A single peripherin/RDS 
mutation apparently caused retinitis pigmentosa, pattern dystrophy and fundus 

30 flavimaculatus, in different family members. 
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In spite of causal heterogeneity, there is significant -clinical similarity 
among RP subtypes. Common signs and symptoms include early 
electroretinographic abnormalities, ophthalmoscopic findings, and 
protracted, contiguous expansion of the ring-like scotoma toward the macula, 
5 leading to progressively worsening tunnel vision. A recent hypothesis is that 
active photoreceptor cell death, which is characteristic of these genetically 
distinct disorders, is mediated by a common induction of apoptosis. It may be 
possible to treat these conditions by the administration of agents that block 
induction of apoptosis in photoreceptors, such as neurotrophic factors. 

10 Adenovirus delivery vectors 

Adenovirus, which is a DNA virus with a 36 kilobase (kb) genome, is very 
well-characterized and its genetics and genetic organization are understood. 
The genetic organization of adenoviruses permits substitution of large fragments 
of viral DNA with foreign DNA. In addition, recombinant adenoviruses are 

15 structurally stable and no rearranged viruses are observed after extensive 
amplification. 

Adenoviruses have been employed as delivery vehicles for introducing 
desired genes into eukaryotic cells. The adenovirus delivers such genes to 
eukaryotic cells by binding to cellular receptors followed by internalization. The 

20 adenovirus fiber protein is responsible for binding to cells. The fiber protein has 
two domains, a rod-like shaft portion and a globular head portion that contains 
the receptor binding region. The fiber spike is a homotrimer, and there are 1 2 
spikes per virion. Human adenoviruses bind to and infect a broad range of 
cultured cell lines and primary tissues from different species. 

25 The 35,000+ base pair (bp) genome of adenovirus type 2 has been 

sequenced and the predicted amino acid sequences of the major coat proteins 
{hexon, fiber and penton base) have been described {see, e.g., Neumann eta/.. 
Gene 69: 153-157 (1988); Herisse eta/., Nuc. Acids Res. 9: 4023-4041 (1981); 
Roberts eta/., J. Biol. Chem. 259: 13968-13975 (1984); Kinloch *et a/., J.Biol. 

30 Chem. 259: 6431-6436 (1984); and Chroboczek et af., Virol. 7£/:549-S54, 
1987). 
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The 35,935 bp sequence of Ad5 DNA is also known and portions of 
many other adenovirus genomes have been sequenced. The upper packaging 
limit for adenovirus virions is about 105% of the wild-type genome length (see, 
e.g., Bett, et at., J. Virol. 67(10): 591 1-21 ,1 993). Thus,, for Ad2 and Ad5. this 
5 would be an upper packaging limit of about 38kb of DNA. 

Adenovirus DNA also includes inverted terminal repeat sequences (ITRs) 
ranging in size from about 100 to 150 bp, depending on the serotype. The 
inverted repeats permit single strands of viral DNA to circularize by base-pairing 
of their terminal sequences to form base-paired "panhandle" structures that are 

10 required for replication of the viral DNA. 

For efficient packaging, the ITRs and the packaging signal (a few hundred bp in 
length) comprise the "minimum requirement" for replication and packaging of a 
genomic nucleic acid into an adenovirus particle. Helper-dependent vectors 
lacking all viral ORFs but including these essential cis elements (the ITRs and 

15 contiguous packaging sequence) have been constructed. 

Ad vectors have several distinct advantages as gene delivery vehicles. 
For example, recombination of such vectors is rare; there are no known 
associations of human malignancies with adenoviral infections despite common 
human infection with adenoviruses; the genome may be manipulated to 

20 accommodate foreign genes of a fairly substantial size; and host proliferation is 
not required for expression of adenoviral proteins. Adenovirus (Ad)-based gene 
delivery vectors efficiently infect many different cells and tissues. This broad 
tropism, however, means that gene delivery cannot be directed to a specific 
target cell. A large fraction of intravenously administered adenovirus is 

25 retained by the liver, which could lead to undesirable side-effects. Adenovirus 
may potentiate immune responses. For example, Adenovirus type 5 (Ad5) also 
transduces dendritic cells, which present antigens very efficiently, thereby 
possibly exacerbating the immune response against the vector. It has been 
proposed that vectors with different targeting efficiencies might eliminate these 

30 problems, permitting a lower total particle dose and more specific targeting (see, 
e.g., U.S. application Serial No. 09/482,682). 
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The wealth of information on adenovirus structure and mechanism of 
infection, its efficient infection of nondividing cells, and its large genetic capacity 
make adenovirus a popular gene therapy vector. The wide expression of 
receptors to which adenovirus binds makes targeting adenovirus vectors 
5 difficult. 

Hence there is a need to improve delivery and targeting of adenoviral 
vectors and also to provide treatments for ocular disorders. Therefore, it is an 
object herein to provide adenoviral vectors that specifically or selectively target 
cells in the eye. It is also an object herein to provide these vectors for treatment £ 

10 of ocular disorders. 

SUMMARY OF THE INVENTION 

Degenerative ocular diseases, such as, but not limited to, retinitis 
pigmentosa, Stargardt's disease, diabetic retinopathies, retinal vascularization, 
and others (see, e.g., Table below), have a genetic basis. Genes expressed in 

15 the photoreceptor cells at the back of the retina are implicated in these diseases. 
Provided herein are recombinant viral vectors for targeting therapeutic products 
to these cells. 

Recombinant adenoviral vectors that include nucleic acid that permits 
specific binding to these photoreceptors are provided. In particular, the vector 

20 particles contain a fiber protein of Ad37 or a modified form thereof. As shown ^ 
herein, fiber protein from Ad37 permits efficient infection of photoreceptor cells. 
Fiber proteins from other adenovirus D serotypes may also be used. In addition, 
the portions of the fiber protein, particularly those that interact with other viral 
structural proteins, such as penton, may be modified to resemble the viral source 

25 of the other structural proteins. As exemplified herein, the recombinant virus 
provided herein include Ad5 structural components. The N-terminus of the Ad37 
fiber protein, which interacts with the penton protein, is modified to resemble 
the Ad5 fiber protein N-terminus to ensure production of viral particles. 

The recombinant adenoviral vectors are intended for gene therapy of 

30 diseases in which genes expressed in the photoreceptors are implicated. Such 
diseases include, but are not limited to, degenerative ocular diseases, such as 
retinitis pigmentosa and Stargardt's disease. These vectors are also useful for 
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targeting to other ocular cells, such as conjunctival cells, which also bear 
receptors to which fiber from Ad37 and related serotypes bind. 

The vectors will deliver therapeutic agents to the targeted cells for 
treatment of a variety of disorders (see e.g.,, Tables 3 and 4, below)). The 
5 therapeutic agents are intended for expression in the photoreceptors and for 
secretion from the photorecptor cells, which are surrounded on one side by 
choroidal vasculature, and on the other side by retinal vasulature, thereby 
providing a means for delivery of products. In addition, expression of growth 
factors, such as VEGF and others, can be used to enhance blood flow to the 

10 retina and prevent or slow the degeneration. 

Therapeutic agents encoded by the recombinant adenoviral vectors 
include, but are not limited to, nucleic acid nucleic acid molecules encoding 
genes that are defective in certain hereditary disorders, nucleic acid molecules 
that encode antiangiogenics and antitumor agents for treatment of retinal 

15 disorders, such as retinoblastomas; nucleic acid molecules encoding trophic 
factors, such as glial cell line-derived neuroptrophic factor (GDNF) and ciliary 
neurotrophic factor (CNTF), growth factors and growth factor inhibitors, 
antiapoptotic factors, such as Bcl-2 (CNTF), antitumor agents, anti-angiogenics, 
and genes or portions thereof for gene replacement or repair of defective genes. 

20 Hence, methods for treatment of inherited and acquired retinal diseases, 

including diseases involving neovascular and vascular degeneration are provided. 

Methods for treating diseases involving genes expressed in photoreceptor 
cells are provided herein. The methods provided herein are practiced by 

25 administration of the recombinant viral vectors by any means suitable for 

delivery to the photoreceptors. A preferred mode of administration is intraocular 
injection including intravitreal and subretinal injection. Other modes of 
administration include, but are not limited to, intrascleral, periorbital and 
intravenous administration. The vectors also can include photoreceptor-specific 

30 promoters thereby providing a means, not only for specific targeting of 
expression in these cells, but also for photoreceptor-restricted transgene 
expression. 
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DET AILED DESCRIPTION OF THE INVENTION 
A. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as is commonly understood by one of skill in the art to 
5 which this invention belongs. All patents, applications, published applications 
and other publications and sequences from GenBank and other data bases 
referred to anywhere in the disclosure herein are incorporated by reference in 
their entirety. 

As used herein, the amino acids, which occur in the various amino acid 
10 sequences appearing herein, are identified according to their three-letter or one- 
letter abbreviations. The nucleotides, which occur in the various DNA 
fragments, are designated with the standard single-letter designations used 
routinely in the art (see, Table 1). 

As used herein, amino acid residue refers to an amino acid formed upon 
15 chemical digestion (hydrolysis) of a polypeptide at its peptide linkages. The 
* amino acid residues described herein are preferably in the isomeric form. 
However, residues in the "D" isomeric form can be substituted for any L-amino 
acid residue, as long as the desired functional property is retained by the 
polypeptide. NH 2 refers to the free amino group present at the amino terminus 
20 of a polypeptide. COOH refers to the free carboxy group present at the carboxyl 
terminus of a polypeptide. In keeping with standard polypeptide nomenclature 
described in J. Biol. Chem., 243:3552-59 (1969) and adopted at 37C.F.R. § § 
1.821 - 1.822, abbreviations for amino acid residues are shown in the following 
Table: 

25 Table 1 



Table of Correspondence 



SYMBOL 




1 -Letter 


3-Letter 


AMINO ACID 


Y 


Tyr 


tyrosine 


G 


Gly 


glycine 


F 


Phe 


phenylalanine 


M 


Met 


methionine 
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5 



10 



15 



SYMBOL 




A 


Ala 


alanine 


S 


Ser 


serine 


1 


lie 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine 


E 


Glu 


glutamic acid 


Z 


Glx 


Glu and/or Gin 


w 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 


N 1 


Asn 


asparagine 


B 


Asx 


Asn and/or Asp 


C 


Cys 


cysteine 


X 


Xaa 


Unknown or other 



20 

It should be noted that all amino acid residue sequences represented . 
herein by formulae have a left to right orientation in the conventional direction of 
amino-terminus to carboxyl-terminus. In addition, the phrase "amino acid 
residue" is broadly defined to include the amino acids listed in the Table of 
25 Correspondence and modified and unusual amino acids, such as those referred 
to in 37 C.F.R. § § 1.821-1.822, and incorporated herein by reference. 
Furthermore, it should be noted that a dash at the beginning or end of an amino 
acid residue sequence indicates a peptide bond to a further sequence of one or 



BNSDOCID: <WO_0t83729A2J_> 




PCT/EP01/04863 



-9- 

more amino acid residues or to an amino-terminal group such as NH 2 or to a 
carboxyl-terminal group such as COOH. 

In a peptide or protein, suitable conservative substitutions of amino acids 
are known to those of skill in this art and may be made generally without, altering 
5 the biological activity of the resulting molecule, those of skill in this art 
recognize that, in general, single amino acid substitutions in non-essential 
regions of a polypeptide do not substantially alter biological activity (see, e.g. , 
Watson et aK Molecular Biology of the Gene, 4th Edition, 1 987, The 
Bejacmin/Cummings Pub. co., p.224). 
10 Such substitutions are preferably made in accordance with those set forth 

in TABLE 2 as follows: 

TABLE 2 





Original residue 


Conservative substitution 




Ala (A) 


Gly; Ser 


15 


Arg (R) 


Lys 




Asn (N) 


Gin; His 




Cys (C) 


Ser 




Gin (Q) 


Asn 




Glu (E) 


Asp 


20 


Gly (G) 


Ala; Pro 




His (H) 


Asn; Gin 




He (I) 


Leu; Val 




Leu (L) 


lie; Val 




Lys (K) 


Arg; Gln;<3lu 


25 


Met (M) 


Leu; Tyr; He 




Phe (F) 


Met; Leu; Tyr 




Ser (S) 


Thr 




Thr(T) 


Ser 




Trp (W) 


Tyr 


30 


Tyr(Y) 


Trp; Phe 




Val (V) 


He; Leu 



Other substitutions are also permissible and may be determined empirically or in 
accord with known conservative substitutions. 
35 As used herein, a complementing plasmid describes plasmid vectors that 

deliver nucleic acids into a packaging cell line for stable integration into a 
chromosome in the cellular genome. 

As used herein, a delivery plasmid is a plasmid vector that carries or 
delivers nucleic acids encoding a therapeutic gene or gene that encodes a 
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therapeutic product or a precursor thereof or a regulatory gene or other factor 
that results in a therapeutic effect when delivered in vivo in or into a cell line, 
such as, but not limited to a packaging cell line, to propagate therapeutic viral 
vectors. 

5 As used herein, a variety of vectors with different requirements are 

described. For example, one vector is used to deliver particular nucleic acid 
molecules into a packaging cell line for stable integration into a chromosome. 
These types of vectors are generally identified herein as complementing 
plasmids. A further type of vector described herein carries or delivers nucleic 

10 acid molecules in or into a cell line (e.g., a packaging cell line) for the purpose of 
propagating therapeutic viral vectors; hence, these vectors are generally referred 
to herein as delivery plasmids. A third "type" of vector described herein is used 
to carry nucleic acid molecules encoding therapeutic proteins or polypeptides or 
regulatory proteins or are regulatory sequences to specific cells or cell types in a 

1 5 subject in need of treatment; these vectors are generally identified herein as 
therapeutic viral vectors or recombinant adenoviral vectors or viral Ad-derived . 
vectors and are in the form of a virus particle encapsulating a viral nucleic acid 
containing an expression cassette for expressing the therapeutic gene. 

As used herein, a DNA or nucleic acid homolog refers to a nucleic acid 

20 that includes a preselected conserved nucleotide sequence, such as a sequence 
encoding a therapeutic polypeptide. By the term "substantially homologous" is 
meant having at least 80%, preferably at least 90%, most preferably at least 
95% homology therewith or a lesser percentage of homology or identity and 
conserved biological activity or function. 

25 The terms "homology" and "identity" are often used interchangeably. In 

this regard, percent homology or identity may be determined, for example, by 
comparing sequence information using a GAP computer program. The GAP 
program utilizes the alignment method of Needleman and Wunsch iJ. Mo/. Biol. 
48:443 (1970), as revised by Smith and Waterman AAdv. Appl. Math. 2:482 

30 {1981). Briefly, the GAP program defines similarity as the number of aligned 

symbols (i.e., nucleotides or amino acids) which are similar, divided by the total 
number of symbols in the shorter of the two sequences. The preferred default 
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parameters for the GAP program may include: (1) a unary comparison matrix 
(containing a value of 1 for identities and 0 for non-identities) and the weighted 
comparison matrix of Gribskov and Burgess, Nucl. Acids Res. 14:6745 (1986), 
as described by Schwartz and Dayhoff, eds., ATLAS OF PROTEIN SEQUENCE 
5 AND STRUCTURE, National Biomedical Research Foundation, pp. 353-358 

(1979); (2) a penalty of 3.0 for each gap and an additional 0.1O penalty for each 
symbol in each gap; and (3) no penalty for 
end gaps. 

Whether any two nucleic acid molecules have nucleotide sequences that 
10 are at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% "identical" can be 
determined using known computer algorithms such as the "FAST A" program, 
using for example, the default parameters as in Pearson and Lipman, Proc. Natl. 
Acad. ScL USA 55:2444 (1988). Alternatively the BLAST function of the 
National Center for Biotechnology Information database maybe used to 
15 determine identity. 

In general, sequences are aligned so that the highest order match is 
obtained. "Identity" per se has an art-recognized meaning and can be -calculated 
using published techniques. (See, e.g.: Computational Molecular Biology, Lesk, 
A.M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics 
20 and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and<3riffin, H:<3., 
eds., Humana Press, New Jersey, 1 994; Sequence Analysis in Molecular 
Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, 
Gribskov, M. and Devereux, J., eds., M^Stockton Press, New York, 1991). 
25 While there exist a number of methods to measure identity between two 

polynucleotides or polypeptide sequences, the term "identity" is well known to 
skilled artisans (Carillo, H. & Upton, D., SIAM J Applied Math 45:1073 (1988)). 
Methods commonly employed to determine identity or similarity between two 
sequences include, but are not limited to, those disclosed in Guide to Huge 
30 Computers, Martin J. Bishop, ed.. Academic Press, San Diego, 1 994, and 
Carillo, H. & Lipton, D., SIAM J Applied Math 45:1073 (1988). Methods to 
determine identity and similarity are codified in computer programs. Preferred 
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computer program methods to determine identity and similarity between two 
sequences include, but are not limited to, GCG program package (Devereux, J., 
etaL, Nucleic Acids Research 12(0:387 (1984)), BLASTP, BLASTN, FASTA 
(Atschul, S.F., et al., J Molec Biol 2/5:403 (1 990)). 
5 Therefore, as used herein, the term "identity" represents a comparison 

between a test and a reference polypeptide or polynucleotide. For example, a 
test polypeptide may be defined as any polypeptide that is 90% or more 
identical to a reference polypeptide. As used herein, the term at least "90% 
identical to" refers to percent identities from 90 to 99.99 relative to the 

10 reference polypeptides. Identity at a level of 90% or more, is indicative of the 
fact that, assuming for exemplification purposes a test and reference 
polynucleotide length of 100 amino acids are compared. No more than 10% 
(i.e., 10 out of 100) amino acids in the test polypeptide differs from that of the 
reference polypeptides. Similar comparisons may be made between a test and 

15 reference polynucleotides. Such differences may be represented as point 

mutations randomly distributed over the entire length of an amino acid sequence 
or they may be clustered in one or more locations of varying length up to the 
maximum allowable, e.g. 10/100 amino acid difference (approximately 90% 
identity). Differences are defined as nucleic acid or amino acid substitutions, or 

20 deletions. 

As used herein, genetic therapy involves the transfer of heterologous 
DNA to the certain cells, target cells, of a mammal, particularly a human, with a 
disorder or conditions for which such therapy is sought. The DNA is introduced 
into the selected target cells in a manner such that the heterologous DNA is 

25 expressed and a therapeutic product encoded thereby is produced. 

Alternatively, the heterologous DNA may in some manner mediate expression of 
DNA that encodes the therapeutic product, it may encode a product, such as a 
peptide or RNA that in some manner mediates, directly or indirectly, expression 
of a therapeutic product. Genetic therapy may also be used to deliver nucleic 

30 acid encoding a gene product to replace a defective gene or supplement a gene 
product produced by the mammal or the cell in which it is introduced. The 
introduced nucleic acid may encode a therapeutic compound, such as a growth 
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factor inhibitor thereof, or a tumor necrosis factor or inhibitor thereof, such as a 
receptor therefor, that is not normally produced in the mammalian host or that is 
not produced in therapeutically effective amounts or at a therapeutically useful 
time. The heterologous DNA encoding the therapeutic product mayi>e modified 
5 prior to introduction into the cells of the afflicted host in order to enhance or 
otherwise alter the product or expression thereof. 

As used herein, heterologous DNA is DNA that encodes RNA and proteins 
that are not normally produced in vivo by the cell in which it is expressed or that 
mediates or encodes mediators that alter expression of endogenous DNA by 

10 affecting transcription, translation, or other regulatable biochemical processes. 
Heterologous DNA may also be referred to as foreign DNA. Any DNA that one 
of skill in the art would recognize or consider as heterologous or foreign to the 
cell in which it is expressed is herein encompassed by heterologous DNA. 
Examples of heterologous DNA include, but are not limited to, DNA that encodes 

1 5 traceable marker proteins, such as a protein that confers drug resistance, DNA 
that encodes therapeutically effective substances, such as anti-cancer agents, 
enzymes and hormones, and DNA that encodes other types of proteins, such as 
antibodies. Antibodies that are encoded by heterologous DNA may be secreted 
or expressed on the surface of the cell in which the heterologous t)NA has been 

20 introduced. 

Hence, herein heterologous DNA or foreign DNA, refers to a DNA 
molecule not present in the exact orientation and position as the counterpart 
DNA molecule found in the corresponding wild-type adenovirus. It may also 
refer to a DNA molecule from another organism or species {i.e., exogenous) or 

25 from another Ad serotype. 

As used herein, a therapeutically effective product is a product that is 
encoded by heterologous DNA that, upon introduction of the DNA into a host, a 
product is expressed that effectively ameliorates or eliminates the symptoms, 
manifestations of an inherited or acquired disease or that cures said disease. 

30 Typically, DNA encoding the desired heterologous DNA is cloned into a 

plasmid vector and introduced by routine methods, such as calcium-phosphate . 
mediated DNA uptake (see, (1981) Somat. Cell. Mol. Genet. 7:603-616) or 



BNSDOCIO: <WO 01B3729A2 I > 



WO 0A9 PCT/EPO 1/04863 




-14- 



microinjection, into producer cells, such as packaging cells. After amplification 
in producer cells, the vectors that contain the heterologous DNA are introduced 
into selected target cells. 

As used herein, an expression or delivery vector refers to any plasmid or 
5 virus into which a foreign or heterologous DNA may be inserted for expression in 
a suitable host cell — i.e., the protein or polypeptide encoded by the DNA is 
synthesized in the host cell's system. Vectors capable of directing the 
expression of DNA segments (genes) encoding one or more proteins are referred 
to herein as "expression vectors." Also included are vectors that allow cloning 
10 of cDNA (complementary DNA) from mRNAs produced using reverse 
transcriptase. 

As used herein, a gene is a nucleic acid molecule whose nucleotide 
sequence encodes RNA or polypeptide. A gene can be either RNA or DNA. 
Genes may include regions preceding and following the coding region (leader and 

1 5 trailer) as well as intervening sequences (introns) between individual coding 
segments (exons). 

As used herein, tropism with reference to an adenovirus refers to the 
selective inf ectivity or binding that is conferred on the particle by the fiber 
protein, such as by the C-terminus portion that comprises the knob. 

20 As used herein, isolated with reference to a nucleic acid molecule or 

polypeptide or other biomolecule means that the nucleic acid or polypeptide has 
separated from the genetic environment from which the polypeptide or nucleic 
acid were obtained. It may also mean altered from the natural state. For 
example, a polynucleotide or a polypeptide naturally present in a living animal is 

25 not "isolated/ but the same polynucleotide or polypeptide separated from the 
coexisting materials of its natural state is "isolated", as the term is employed 
herein. Thus, a polypeptide or polynucleotide produced and/or contained within 
a recombinant host cell is considered isolated. Also intended as an "isolated 
polypeptide" or an "isolated polynucleotide" are polypeptides or polynucleotides 

30 that have been purified, partially or substantially, from a recombinant host cell or 
from a native source. For example, a recombinantly produced version of a 
compound can be substantially purified by the one-step method described in 
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Smith and Johnson, Gene 57/31-40 (1988). The terms isolated and purified are 
sometimes used interchangeably. 

Thus, by "isolated" is meant that the nucleic acid is free of the coding 
sequences of those genes that, in the naturally-occurring genome of the 
5 organism (if any) immediately flank the gene encoding the nucleic acid of 

interest. Isolated DNA may be single-stranded or double-stranded, and may be 
genomic DNA, cDNA, recombinant hybrid DNA, or synthetic DNA. It may be 
identical to a native DNA sequence, or may differ from such sequence by the 
deletion, addition, or substitution of one or more nucleotides. 

Isolated or purified as it refers to preparations made from biological cells 
or hosts means any cell extract containing the indicated DNA or protein including 
a crude extract of the DNA or protein of interest. For example, in the t;ase of a 
protein, a purified preparation can be obtained following an individual technique 
or a series of preparative or biochemical techniques and the DNA or protein of 
interest can be present at various degrees of purity in these preparations. The 
procedures may include for example, but are not limited to, ammonium sulfate 
fractionation, gel filtration, ion exchange chromatography, affinity 
chromatography, density gradient centrifugation and electrophoresis. 

A preparation of DNA or protein that is "substantially pure" or "isolated" 
should be understood to mean a preparation free from naturally occurring 
materials with which such DNA or protein is normally associated in nature. 
"Essentially pure" should be understood to mean a "highly" purified preparation 
that contains at least 95% of the DNA or protein of interest. 

A cell extract that contains the DNA or protein of interest should be 
understood to mean a homogenate preparation or cell-free preparation obtained 
from cells that express the protein or contain the DNA of interest. The term 
"cell extract" is intended to include culture media, especially spent culture media 
from which the cells have been removed. 

As used herein, a packaging cell line is a cell line that provides a missing 
gene product or its equivalent. 

As used herein, an adenovirus viral particle is the minimal structural or 
functional unit of a virus. A virus can refer to a single particle, a stock of 
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particles or a viral genome. The adenovirus (Ad) particle is relatively complex 
and may be resolved into various substructures. 

As used herein, "penton" or "penton complex" are preferentially used 
herein to designate a complex of penton base and fiber. The term "penton" may 
5 also be used to indicate penton base, as well as penton complex. The meaning 
of the term "penton" alone should be clear from the context within which it is 
used. 

As used herein, a plasmid refers to an autonomous self-replicating 
extrachromosomal circular nucleic acid molecule, typically DNA. 

10 As used herein, a post-transcription regulatory element (PRE) is a 

regulatory element found in viral or cellular messenger RNA that is not spliced, 
i.e. introhless messages. Examples include, but are not limited to, human 
hepatitis virus, woodchuck hepatitis virus, the TK gene and mouse histone gene. 
The PRE may be placed before a polyA sequence and after a heterologous DNA 

1 5 sequence. 

As used herein, pseudotyping describes the production of adenoviral 
vectors having modified capsid protein or capsid proteins from a different 
serotype than the serotype of the vector itself. One example, is the production 
of an adenovirus 5 vector particle containing an Ad37 fiber protein. This may he 

20 accomplished by producing the adenoviral vector in packaging cell lines 
expressing different fiber proteins. 

As used herein, promoters of interest herein may be inducible or 
constitutive. Inducible promoters will initiate transcription only in the presence 
of an additional molecule; constitutive promoters do not require the presence of 

25 any additional molecule to regulate gene expression, a regulatable or inducible 
promoter may also be described as a promoter where the rate or extent of RNA 
polymerase binding and initiation is modulated by external stimuli. Such stimuli 
include, but are not limited to various compounds or compositions, light, heat, 
stress and chemical energy sources. Inducible, suppressible and repressive 

30 promoters are considered regulatable promoters. Preferred promoters herein, are 
promoters that are selectively expressed in ocular -cells, particularly 
photoreceptor cells. 
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As used herein, receptor refers to a biologically active molecule that 
specifically or selectively binds to (or with) other molecules. The term "receptor 
protein" may be used to more specifically indicate the proteinaceous nature of a 
specific receptor. 

5 As used herein, recombinant refers to any progeny formed as the result of 

genetic engineering. This may also be used to describe a virus formed by 
recombination of plasmids in a packaging cell. 

As used herein, a transgene or therapeutic nucleic acid molecule includes 
DNA and RNA molecules encoding an RNA or polypeptide. Such molecules may 
10 be "native" or naturally-derived sequences; they may also be "non-native" or 
"foreign" that are naturally- or recombinant ly-derived. The term "transgene," 
which may be used interchangeably herein with the term "therapeutic nucleic 
acid molecule," is often used to describe a heterologous or foreign (exogenous) 
gene that is carried by a viral vector and transduced into a host cell. 
1 5 Therefore, therapeutic nucleotide nucleic acid molecules include antisense 

sequences or nucleotide sequences which may be transcribed into antisense 
sequences. Therapeutic nucleotide sequences (or transgenes) all include nucleic 
acid molecules that function to produce a desired effect in the cell or cell nucleus 
into which said therapeutic sequences are delivered. For example, a therapeutic 
20 nucleic acid molecule can include a sequence of nucleotides that encodes a 
functional protein intended for delivery into a cell which is unable to produce 
that functional protein. 

As used herein, the vitreous of the eye refers to material that fills the 
chamber behind the lens of the eye (i.e., vitreous humor or vitreous body). 
25 As used herein, a promoter region refers to the portion of DNA of a gene 

that controls transcription of the DNA to which it is operatively linked. The 
promoter region includes specific sequences of DNA that are sufficient for RNA 
polymerase recognition, binding and transcription initiation. This portion of the 
promoter region is referred to as the promoter. In addition, the promoter region 
30 includes sequences that modulate this recognition, binding and transcription 

initiation activity of the RNA polymerase. These sequences may bec/s acting or 



BNSDOCIO: <WO 0183729A2J_> 



WO 



PCT/EP01/04863 



-18- 

may be responsive to trans acting factors. Promoters, depending upon the 
nature of the regulation, may be constitutive or regulated. 

Thus, promoters are nucleic acid fragments that contain a DNA sequence 
that controls the expression of a gene located 3' or downstream of the 
5 promoter. The promoter is the DNA sequence to which RNA polymerase 
specifically binds and initiates RNA synthesis (transcription) of that gene, 
typically located 3' of the promoter. A promoter also includes DNA sequences 
that direct the initiation of transcription, including those to which RNA 
polymerase specifically binds. If more than one nucleic acid sequence encoding a 

10 particular polypeptide or protein is included in a therapeutic viral vector or 
nucleotide sequence, more than one promoter or enhancer element may be 
included, particularly if that would enhance efficiency of expression. 

A regulatable or inducible promoter may be described as a promoter 
wherein the rate of RNA polymerase binding and initiation is modulated by 

15 external stimuli, (see, e.g., U.S. Patent Nos. 5,750,396 and 5,998,-205). Such 
stimuli include various compounds or compositions, light, heat, stress, chemical 
energy sources, and the like. Inducible, suppressive and repressive promoters 
are considered regulatable promoters. 

Regulatable promoters may also include tissue-specific promoters. 

20 Tissue-specific promoters direct the expression of the gene to which they are 
operably linked to a specific cell type. Tissue-specific promoters -cause the gene 
located 3' of it to be expressed predominantly, if not exclusively, in the specific 
cells where the promoter expressed its endogenous gene. Typically, it appears 
that if a tissue-specific promoter expresses the gene located 3' of it at all, then 

25 it is expressed appropriately in the correct cell types (see, e.g., Palmiter et al. 
(1986) Ann. Rev. Genet. 20: 465-499). 

As used herein, the phrase "operatively linked" generally means the 
sequences or segments have been covalently joined into one piece of DNA, 
whether in single or double stranded form, whereby control sequences on one 

30 segment control expression or replication or other such control of other 
segments. The two segments are not necessarily contiguous. 
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As used herein, exogenous encompasses any therapeutic composition 
that is administered by the therapeutic methods provided herein. Thus, 
exogenous may also be referred to herein as foreign, or non-native or other 
equivalent expression. 
5 B. Ad37 fiber tropism 

The adenovirus fiber protein is a major determinant of adenovirus tropism 
(Gall eta/. (1996) J. Virol. 70:2116-2123; Stevenson etal. (1995) J. Virol. 
65:2850-2857). The fiber protein extends from the capsid and mediates viral 
binding to the cell surface by binding to specific cell receptors (Philipson et al. 
10 (1968) J. Virol. 2:1064-1075). The fiber is a trimeric protein that includes an N- 
terminal tail domain that interacts with the adenovirus penton base, a central 
shaft domain of varying length, and a C-terminal knob domain that contains the 
cell receptor binding site (Chroboczek et al. (1995) Curr.Top. Microbiol. Immunol. 
755:163-200; Riurok etal. (1990) J.Mol.Biol. 275:589-596; Stevenson etal. 
15 (1995) J. Virol. 65:2850-2857). Fiber proteins of most adenovirus subgroups 
have been shown to bind specifically or selectively to the 46 kDa coxsackievirus- 
adenovirus receptor (CAR), (Bergelson et al. (1997) Science 275:1320-1323; 
Roelvink etal. (1998) J. Virol. 72:7909-7915). CAR appears to be expressed in 
a variety of human tissues, including the lung, at various levels (Bergelson ~et al. 
20 (1997) Science 275:1320-1323), but Ad37 binds poorly to lung epithelial cells 
(Huang etal. (1999) J. Virol. 75:2798-2802). This suggests that the tropism of 
this serotype may be influenced by factors independent of CAR expression. 

Structural and biochemical data also suggest that distinct receptor binding 
sites are located on different regions of the Ad5 and Ad37 fiber knobs. 
25 Adopting the nomenclature of Xia et al. (Xia etal. (1994) Structure 2:1259- 

1 270), the receptor binding site for Ad5 is located at the AB-loop on the side of 
the fiber knob (Bewley et at. (1999) Science 256:1579-1583; Roelvink etal. 
(1 999) Science 286:1 568-1 571 ). It is known that a lysine residue at position 
240 of the Ad37 fiber, located in the CD-loop, is important for receptor binding 
30 (Huang etal. (1999) J. Virol. 73:2798-2802). The co-crystal structure of the 
Ad1 2 knob and the N-terminal domain of CAR (Bewley etal. (1999) Science 
266:1579-1583) show that the CD-loop does not contact CAR. It thus appears 
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that different regions of the Ad5 and Ad37 fiber knobs recognize distinct cell 
receptors. 

A 46 kDa receptor for coxsackieviruses and adenoviruses 4CAR) mediates 
attachment for many adenovirus serotypes. The wide distribution of CAR fails 
to explain why certain adenovirus serotypes (i.e. Ad37) are highly associated 
with severe ocular infections such as epidemic keratoconjunctivitis <EKC). Ad37 
does not use CAR, but instead uses a glycoprotein that contains sialic acid as its 
primary receiptor (Arnberg etal. ((2000) J. Virol. 74:42-48). The modest 
number of Ad37 binding sites per cell (Huang eta/. (1999) J. Virol. 73:2798- 
2802) also suggests that Ad37 recognizes a specific glycoprotein as its primary 
receptor for binding to conjunctival cells. 

Adenovirus type 37 (subgroup D) has been associated with infections of 
the eye and genital tract. The tropism of Ad37 derives from the binding 
preference of its fiber protein, which binds to a receptor located on the surface 
of cells including Chang C, conjunctival epithelial cell line (Huang etal. (1999) J. 
Virology 73:2798-2802). 

A protein receptor that is preferentially expressed on conjunctival cells to 
which Ad37 fiber binds is shown herein. The preferential expression of the 
Ad37 receptor protein on conjunctival cells suggests that this receptor likely 
influences Ad37 tropism and should play a key role in ocular pathogenesis. It is 
shown herein that Ad37 uses a distinct protein receptor that is selectively 
expressed on conjunctival cells. It is shown that Ad37 binds well to conjunctival 
cells (Chang C), but poorly to lung carcinoma cells (A549). To determine if 
infection correlated with cell binding, an Ad5 vector containing the Ad37 fiber 
protein was constructed. The 'pseudotyped' vector delivered transgenes to 
Chang C cells better than to A549 cells. Ad37 binding was abolished by 
protease treatment of Chang C cells, indicating the receptor is a membrane 
protein. Ad37 binding to conjunctival cells is shown herein to be calcium- 
dependent. It is also shown that Ad37 infection was not inhibited by a function- 
blocking anti-CAR monoclonal antibody, which is a feature distinct from Ad5 
fiber interaction with CAR. Using a virus overlay protein blot assay (VOPBA), 
calcium-dependent Ad37 binding to a 50 KDa membrane protein on Chang C 
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cells, but not A549 cells was detected. Ad19p a closely related serotype that 
fails to bind to conjunctival cells, does not recognize the SO kDa protein. 
Together, these data indicate that the 50 kDa protein is a candidate receptor for 
Ad37 on conjunctival cells. 



vector to the vitreous humor, the recombinant adenovirus with the Ad37 fiber 
preferentially and selectively binds to photoreceptor cells. Hence, a recombinant 
adenoviral dielivery vehicle that has an Ad37 fiber protein can serve as a vector 
for delivery of therapeutic agents to the eye for treatment of ocular disorders, 
10 including genetic and acquired disorders. The identification of the receptor for 
Ad37 and the resulting recognition of Ad37 tropism allows targeting of 
adenovirus vectors to specific human ocular cells. 

As noted, fiber plays a crucial role in adenovirus infection by attaching 
the virus to a specific receptor on a cell surface. Hexon, penton and fiber 

15 capsomeres are the major components on the surface of the virion. The fiber is 
an elongated protein which exists as a trimer of three identical polypeptides 
(polypeptide IV) of 582 amino acids in length. An adenovirus fiber includes 
three domains: an N-terminal tail domain that interacts with penton base; a 
shaft composed of variable numbers of repeats of a 15-amino-acid segment that 

20 forms beta-sheet and beta-bends; and a knob at the C-terminus ("head domain") 
that contains the type-specific antigen and is responsible for binding to the cell 
surface receptor. The gene encoding the fiber protein from Ad2 has been 
expressed in human cells and has been shown to be correctly assembled into 
trimers, glycosylated and transported to the nucleus (see, -e.g., Hong and Engler, 

25 Virology 185: 758-761, 1991). Thus, alteration of the fiber in recombinant Ad 
vectors can lead to alteration in gene delivery. 

As shown herein, alteration of fiber in recombinant Ad vectors such that 
the fiber is derived from Ad37 or another adenovirus serotype D, provides a 
means for selective delivery of a recombinant virus to particular cells in the jeye, 

30 including conjunctival cells, and most significantly photoreceptors, thereby 
providing a means for targeted delivery to photoreceptor cells. 



5 



Significantly, it is also shown herein that, upon administration of the 
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Photoreceptor cells are implicated in a number of hereditary and acquired 
retinal degenerative disorders. In addition, photoreceptor cells are located such 
that products produced therein can be delivered to other areas of the eye by 
virtue of the blood flow in the vicinity of the photoreceptor cells and also by 
5 virtue of the proximity of the photoreceptors to the retinal pigmented epithelium 
(RPE) and other retinal cells. 

Hence it is contemplated herein that the recombinant viral vector will 
include a packaged recombinant adenovirus genome containing at least the 
minimal elements for replication and packaging; heterologous DNA encoding a 

10 desired gene product, typically a therapeutic product or plurality of products, 
such as several trophic factors, whose combined activity is effective for treating 
a disorder, such as a retinal degenerative disorder; and the resulting virion 
particles will include a fiber that has a sufficient portion to confer specific 
targeting to photoreceptor cells when the recombinant viral particles are 

15 introduced into the aqueous humor of a mammalian, preferably a human, eye, or 
otherwise contacted with the photoreceptor cells. The fiber may be a chimeric 
protein that has been modified for effective interaction with other coat structural 
proteins, such as penton. In addition, the fiber may be modified to include other 
elements that alter its tropism to permit binding to other cells as well (see, e.g., 

20 U.S. Patent Nos. 5,756,086 and 5,543,328, International PCT application No. 
WO 95/26412 and WO 98/44121 and Krasnykh, eta/. {J. Virol. 70:^839-46, 
1996). 

C. Construction of the viral particles 

1. Selection of viral genome and fiber protein 

25 Methods for preparing recombinant adenoviral vectors for gene 

product delivery are well known. Preferred among those are the methods 
exemplified herein (see EXAMPLES) and also described in copending U.S. 
application Serial No. 09/482,682 (also filed as International PCT application 
No. PCT/US00/00265, filed January 14, 2000, which claims priority to U.S. 

30 provisional application Serial No. 60/1 15,920, as does U.S. application Serial 
No. 09/482,682)). 
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As noted, any desired recombinant adenovirus is contemplated for use in 
the methods herein as long as the viral genome is packaged in a <capsid that 
includes at least the portion of a fiber protein that provides selective binding to 
photoreceptor cells. This fiber protein is preferably from an adenovirus type D 
5 serotype and is preferably an Ad37 fiber. The fiber protein should retain the 
knob region at the C-terminus ("head domain") from the Ad virus of subgroup D 
that contains the type-specific antigen and is responsible for binding to the cell 
surface receptor. Hence the fiber protein can be a chimeric fiber protein as long 
as it retains a sufficient portion of the type D serotype to specifically or 
10 selectively bind to photoreceptor cells. Generally the portion retained will be all 
or a portion of the knob region. The precise amount of knob region required can 
be determined empirically by including portions thereof and identifying the 1 
minimum residues from and Ad type D serotype, preferably Ad37, to effect 
selective targeting of a virion packaged with such fiber to photoreceptors in the 

15 eye upon introduction of the packaged virion into the aqueous humor. 

Recombinant adenovirus containing heterologous nucleic acids that 
encode a desired product, such a gene to correct a genetic defect, may be made 
by any methods known to those of skill in the art. The viruses must be 
packaged in a cell line that results in expression of fiber on the particles that 

20 specifically, electively or preferentially targets (binds and results in 

internalization) the viral particle to cells in the eye. The fiber protein from Ad37 
and other Adenoviruses of serotype D that infect the eye effects such targeting. 
The resulting adenovirus particles that express such fiber is administered by 
intraocular injection, subretinal injection, particularly intravitreal injection, or any 

25 means that results in preferential accumulation in photoreceptor cells. 

The family of Adenoviridae includes many members with at teast 47 
known serotypes of human adenovirus (Ad1-Ad47) (Shenk, Virology, Chapter 
67, in Fields et at., eds. Lippincott-Raven, Philadelphia, 1996,) as well as 
members of the genus Mastadenovirus including human, simian, bovine, equine, 

30 porcine, ovine, canine and opossum viruses and members of the Aviadenovirus 
genus, including bird viruses, such as CELO. 
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Thus it is contemplated that the methods herein can be applied to any 
recombinant viral vectors derived from any adenovirus species. One of skill in 
the art would have knowledge of the different adenoviruses (see, e.flr.,Shenk, 
Virology, Chapter 67 , in Fields et al.+ eds. LippincottTRaven,. Philadelphia, 1996.,) 
and can construct recombinant viruses containing portions of the genome of any 
such virus. 

In the exemplified embodiment, viral particles with Ad37 fiber were 
prepared. Site-directed mutations were made to the Ad37 fiber gene to make 
the tail sequence more closely match that of Ad5 to facilitate Ad37 fiber binding 
to the Ad5 penton base. The plasmid for the expression of the Ad37 fiber 
protein, pDV80, contains the CMV promoter, the adenovirus type 5 tripartite 
leader (TPL), and the modified Ad37 fiber gene sequence. Genes of interest, 
such as nucleic acid encoding the /?subunit of cGMP phosphodiesterase (0PDE), 
^-glucuronidase, rhodopsin, growth factors, anti-cancer agents, growth factor 
receptors and other anti-angiogenic agents, and anti-apoptotic agents, can be 
incorporated into these vectors using the methods known to those of skill in the 
art and exemplified herein. 

Known adenovirus vectors, previously constructed for intraocular therapy 
(see, e.g., Bennett et aL (1996) Nature Medicine 2:649-654, which provides an 
Ad virus encoding /?PDE for treatment of retinitis pigmentosa; Cayouette et aL 
(1998) Human Gene Therapy 5:423-430, which provides an Ad vector that 
expresses CNTF for treatment of retinitis pigmentosa and other retinal 
degenerative diseases; and Li eta/. (1995) Proc. Natl. Acad. Sci. U.S.A. 
52:7700-7704, which provides an Ad virus vector that encodes a human /?- 
glucuronidase for treatment of lysosomal storage disease caused by /?- 
glucuronidase deficiency) can be modified by repackaging the recombinant 
genome using a packaging line that expresses an Ad37 fiber or other O serotype 
fiber. 

For exemplification, nucleic acid encoding GFP was incorporated into 
these vectors as a means to visualize their localization. Other genes, such as 
genes that encode therapeutic products, my be included in place of or in addition 
to GFP. 
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Plasmid pDV80 was electroporated into E1-2a S8 cells and stable lines 
were selected. The fiber-deleted vectors Ad5./?gal.AF and Ad5.GFP.AF were 
grown in cells in a resulting cell line, designated 705, to produce virions, which 
express the Ad37 fiber (Ad5.£gal.AF/37F and Ad5.GFP;AF/37F) and CsCI- 
5 purified. These virions selectively transduce photoreceptor cells when injected 
intraocularly into the vitreous humor. 
2. Packaging 

Recombinant adenoviral vectors generally have at least a deletion in the 
first viral early gene region, referred to as E1 , which includes the E1a and E1b 

10 regions. Deletion of the viral E1 region renders the recombinant adenovirus 

defective for replication and incapable of producing infectious viral particles in 
subsequently-infected target cells. Thus, to generate E1 -deleted adenovirus 
genome replication and to produce virus particles requires a system of 
complementation which provides the missing E1 gene product. E1 

15 complementation is typically provided by a cell line expressing €1, such as the 
human embryonic kidney packaging cell line, i.e. an epithelial cell line, called 
293. Cell line 293 contains the E1 region of adenovirus, which provides £1 
gene region products to "support" the growth of E1 -deleted virus in the cell line 
(see, e.g., Graham etaL, J. Gen. Virol. 36: 59-71, 1977). Additionally, cell lines 

20 that may be usable for production of defective adenovirus having a portion of 
the adenovirus E4 region have been reported (WO 96/22378). 

Multiply deficient adenoviral vectors and complementing cell lines have 
also been described (WO 95/34671, U.S. Patent No. 5,994,106). 

Copending U.S. application Serial No. 09/482,682 (also filed as 

25 International PCT application No. PCT/USOO/00265, filed January 14, 2000)) 
provides packaging cell lines that support viral vectors with deletions of major 
portions of the viral genome, without the need for helper viruses and also 
provides cell lines and helper viruses for use with helper-dependent vectors. 
The packaging cell line has heterologous DNA stably integrated into the 

30 chromosomes of the cellular genome. The heterologous DNA sequence encodes 
one or more adenovirus regulatory and/or structural polypeptides that 
complement the genes deleted or mutated in the adenovirus vector genome to 
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be replicated and packaged. The packaging cell line express, for example, one 
or more adenovirus structural proteins, polypeptides, or fragments thereof, such 
as penton base, hexon, fiber, polypeptide Ilia, polypeptide V, polypeptide VI, 
polypeptide VII, polypeptide VIII, and biologically active fragments thereof. The 
5 expression can be constitutive or under the control of a regulatable promoter. 
These cell lines are designed for expression of recombinant adenoviruses 
intended for delivery of therapeutic products. 

Particular packaging cell lines complement viral vectors having a deletion 
or mutation of a DNA sequence encoding an adenovirus structural protein, 
10 regulatory polypeptides E1A and E1B, and/or one or more of the following 

regulatory proteins or polypeptides: E2A, E2B, E3, E4, L4, or fragments thereof. 

The packaging cell lines are produced by introducing -each DNA molecule 
into the cells and then into the genome via a separate complementing plasmid or 
plurality of DNA molecules encoding the complementing proteins can be 
15 introduced via a single complementing plasmid. Of interest herein, is a variation 
in which the complementing plasmid includes DNA encoding adenovirus fiber 
protein (or a chimeric or modified variant thereof), from Ad virus of subgroup D, 
such as Ad 37, polypeptide or fragment thereof. 

For therapeutic applications, the delivery plasmid further includes a 
20 nucleotide sequence encoding a foreign polypeptide. "Exemplary delivery 

plasmids include, but are not limited to, pDV44, pAE1Bj?-gal and pA£1sp1B. In 
a similar or analogous manner, therapeutic genes may be introduced. 

The cell further includes a complementing plasmid encoding a fiber as 
contemplated herein; the plasmid or portion thereof is integrated into a 
25 chromosome(s) of the cellular genome of the cell. 

In one embodiment, a composition comprises a cell containing first and 
second delivery plasmids wherein a first delivery plasmid comprises an 
adenovirus genome lacking a nucleotide sequence encoding fiber and incapable 
of directing the packaging of new viral particles in the absence of a second 
30 delivery plasmid, and a second delivery plasmid comprises an adenoviral genome 
capable of directing the packaging of new viral particles in the presence of the 
first delivery plasmid. 
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ln a variation, the packaging cell line expresses fiber protein or chimeric 
variant thereof from an Ad virus of subgroup D, preferably Ad37, serotype or it 
can be any fiber protein but one that has been modified to include the portion of 
the Ad virus of subgroup D, such as Ad37, responsible for selective tar.geting. to 
5 photoreceptors upon introduction into the vitreous humor of the eye of a 
mammal, preferably a human. The fiber protein can be further modified to 
include a non-native amino acid residue sequence that targets additional specific 
receptors. In all instances, the modification should not disrupt trimer formation 
or transport of fiber into the nucleus. In another variation, the non-native amino 
10 acid residue sequence alters the binding specificity of the fiber for a targeted cell 
type. The structural protein is fiber can include amino acid residue sequences 
from more than one adenovirus serotype. The nucleotide sequences encoding 
fiber protein or polypeptide need not be modified solely at one or both termini; 
fiber protein, may be modified "internally" as well as at the termini. 
15 Additional nucleic acid fragments can encode polypeptides that are added 

to the fiber protein. In one variation, the non-native amino acid residue 
sequence is coupled to the carboxyl terminus of the fiber. In another, the non- 
native amino acid residue sequence further includes a linker sequence. 
Alternatively, the fiber protein further comprises a ligand coupled to the linker. 
20 Suitable ligands include, but are not limited to, ligands that specifically or 

* selectively bind to a cell surface receptor and ligands that can be used to -couple 
other proteins or nucleic acid molecules. Typically, the packaging cell lines will 
contain nucleic acid encoding the fiber protein or modified protein stably 
integrated into a chromosome or chromosomes in the cellular genome. 
25 The packaging cell line can be derived from a procaryotic cell line or from 

a eukaryotic cell line. While various embodiments suggest the use of mammalian 
cells, and more particularly, epithelial cell lines, a variety of other, non-epithelial 
cell lines are used in various embodiments. Thus, while various embodiments 
disclose the use of a cell line selected from among the 293, A549, W162, HeLa, 
30 Vero, 21 1, and 21 1A cell lines, and any other cell lines suitable for such use are 
likewise contemplated herein. 

3. Components of the nucleic acid molecule included in the particle 
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A recombinant viral vector or therapeutic viral vector forojse in the 
methods herein, typically includes a nucleic acid fragment that encodes a 
protein or polypeptide molecule, or a biologically active fragment thereof, or 
other regulatory sequence, that is intended for use. in. therapeutic applications. 
5 The nucleic acid molecule to be packaged in the viral particle also may 

include an enhancer element and/or a promoter located 3' or 5' to and 
controlling the expression of the therapeutic product-encoding nucleic acid 
molecule if the product is a protein. Further, for purposes herein, the promoter 
and/or other transcriptional and translational regulatory sequences controlling 

10 expression of the product is preferably one that is expressed specifically in the 
targeted cells, such as the a photoreceptor-specific promoter, such as a 
rhodopsin gene promoter. 

The nucleic acid molecule to be packaged in viral capsid includes at least 
2 different operatively linked DNA segments. The DNAcan be manipulated and 

15 amplified by PCR as described herein and by using standard techniques, such as 
those described in Molecular Cloning: A Laboratory Manual, 2nd Ed., Sambrook 
etaL, eds., Cold Spring Harbor, New York (1989). Typically, to produce such 
molecule, the sequence encoding the selected polypeptide and the promoter or 
enhancer are operatively linked to a DNA molecule capable of autonomous 

20 replication in a cell either in vivo or in vitro. By operatively linking the enhancer 
element or promoter and nucleic acid molecule to the vector, the attached 
segments are replicated along with the vector sequences. 

Thus, the recombinant DNA molecule (rDNA) is a hybrid DNA molecule 
comprising at least 2 nucleotide sequences not normally found together in 

25 nature. In various preferred embodiments, one of the sequences is a sequence 
encoding an Ad-derived polypeptide, protein, or fragment thereof. The nucleic 
acid molecule intended to be packaged is from about 20 base pairs to about 
40,000 base pairs in length, preferably about 50 bp to about 38,000 bp in 
length. In various embodiments, the nucleic acid molecule is of sufficient length 

30 to encode one or more adenovirus proteins or functional polypeptide portions 
thereof. Since individual Ad polypeptides vary in length from about 1 9 amino 
acid residues to about 967 amino acid residues, encoding nucleic acid molecules 



8NSOOCID: «WO 01637g9A2_l_> 




PCT/EP01AW863 



-29- 

from about 50 bp up to about 3000 bp, depending on the number and size of 
individual polypeptide-encoding sequences that are "replaced" in the viral 
vectors by therapeutic product-encoding nucleic acid molecules. 

Preferably the molecule includes an adenovirus tripartite leader (TPL) 
5 nucleic acid sequence operatively linked to an intron containing RNA processing 
signals (such as for example, splice donor or splice acceptor sites) suitable for 
expression in the packaging cell line. Most preferably the intron contains a splice 
donor site and a splice acceptor site. Alternatively, the TPL nucleotide sequence 
may not comprise an intron. The intron includes any sequence of 
10 nucleotides that function in the packaging cell line to provide RNA processing 
signals, including splicing signals. Introns have been well characterized from a 
large number of structural genes, and include but are not limited to a native 
intron 1 from adenovirus, such as Ad5's TPL intron 1; others include the SV40 
VP intron; the rabbit beta-globin intron, and synthetic intron constructs tsee, 
15 e.g., Petitclerc eta/. (1995U BiothechnoL, 40:169; and Choi eta/. (19910 MoL 
Cell. Biol., 7/:3070). 

The nucleic acid molecule encoding the TPL includes either (a) first and 
second TPL exons or (b) first, second and third TPL excns, where each TPL exon 
in the sequence is selected from among the complete TPL exon 1 , partial TPL 
20 exon 1 , complete TPL exon 2 and complete TPL exon 3. A complete exon is 
one which contains the complete nucleic acid sequence based on the sequence 
found in the wild type viral genome. Preferably the TPL exons are from Ad2, 
Ad3, Ad5, Ad7 and the like, however, they may come from any Ad serotype, as 
described herein. A preferred partial TPL exon 1 is described in the Examples. 
25 The use of a TPL with a partial exon 1 has been reported (International PCT 
application No. WO 98/13499). 

The intron and the TPL exons can be operatively linked in a variety of 
configurations to provide a functional TPL nucleotide sequence. An intron may 
not be a part of the construct. For example, the intron can be positioned 
30 between any of TPL exons 1 , 2 or 3, and the exons can be in any order of first 
and second, or first/second/third. The intron can also be placed preceding the 
first TPL exon or following the last TPL exon. In a preferred embodiment, 
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complete TPL exon 1 is operatively linked to complete TPL exon 2 operatively 
linked to complete TPL exon 3. In a preferred variation, adenovirus TPL intron 1 
is positioned between complete TPL exon 1 and complete TPL exon 2. It may 
also be possible to use analogous translational regulators from other viral 
5 systems such as rabiesvirus. 

A preferred "complete" TPL nucleic acid molecule containing complete 
TPL exons 1 , 2 and 3 with adenovirus intron 1 inserted between exons 1 and 2 
has a nucleotide sequence shown in SEQ ID NO: 32. A preferred "partial" TPL 
nucleic acid molecule containing partial TPL exon 1 and complete TPL exons 2 
10 and 3 in that order has a nucleotide sequence shown in SEQ ID NO: 26. The 
construction of these preferred TPL nucleotide sequences is described in the 
Examples. 

Thus, preferred expression cassettes and complementing plasmids for 
expressing adenovirus structural genes, particularly fiber protein, contain an 
1 5 adenovirus TPL nucleotide sequence as described herein. 
4. Complementing Plasmids 

Also contemplated are the use of nucleic acid molecules, typically in the 
form of DNA plasmid vectors, which are capable of expression of an adenovirus 
structural protein or regulatory protein. Because these expression plasmids are 
20 used to complement the defective genes of a recombinant adenovirus vector 
genome, the plasmids are referred to as complementing or complementation 
plasmids. 

The complementing plasmid contains an expression cassette, a nucleotide 
sequence capable of expressing a protein product encoded by the nucleic acid 

25 molecule. Expression cassettes typically contain a promoter and a structural 

gene operatively linked to the promoter. The complementing plasmid can further 
include a sequence of nucleotides encoding TPL nucleotide to enhance 
expression of the structural gene product when used in the context of 
adenovirus genome replication and packaging. 

30 A complementing plasmid can include a promoter operatively linked to a 

sequence of nucleotides encoding an adenovirus structural polypeptide, such as, 
but are not limited to, penton base; hexon; fiber; polypeptide Ilia; polypeptide V; 
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polypeptide VI; polypeptide VII; polypeptide VIII; and biologically active 
fragments thereof. In another variation, a complementing plasmid may also 
include a sequence of nucleotides encoding a first adenovirus regulatory 
polypeptide, a second regulatory polypeptide, and/or a third regulatory 
5 polypeptide, and any combination of the foregoing. 

Plasmid pDV80 is a preferred plasmid herein. Other plasmids constructed 
in an analogous manner to encode modified fiber proteins and chimeric fiber 
proteins are also contemplated herein. 



prepared using any suitable method, such as the phosphotriester or 
phosphodiester methods (see, e.g., Narang (1979) etal., Meth. EnzymoL, 
68:90; U.S. Patent No. 4,356,270; and Brown et a/., (1979) Meth. Enzymol., 
68:109). For oligonucleotides, the synthesis of the family members can be 

15 conducted simultaneously in a single reaction vessel, or can be synthesized 
independently and later admixed in preselected molar ratios. For simultaneous 
synthesis, the nucleotide residues that are conserved at preselected positions of 
the sequence of the family member can be introduced in a chemical synthesis 
protocol simultaneously to the variants by the addition of a single preselected 

20 nucleotide precursor to the solid phase oligonucleotide reaction admixture when 
that position number of the oligonucleotide is being chemically added to the 
growing oligonucleotide polymer. The addition of nucleotide residues to those 
positions in the sequence that vary can be introduced simultaneously by the 
addition of amounts, preferably equimolar amounts, of multiple preselected 

25 nucleotide precursors to the solid phase oligonucleotide reaction admixture 
during chemical synthesis. For example, where all four possible natural 
nucleotides (A,T,G and C) are to be added at a preselected position, their 
precursors are added to the oligonucleotide synthesis reaction at that step to 
simultaneously form four variants (see, e.g., Ausubel etal. (Current Protocols in 



30 Molecular Biology, Suppl. 8. p.2.1 1 .7, John Wiley & Sons, Inc., New York 
,1991). 



5. Nucleic Acid Molecule Synthesis 

A nucleic acid molecule comprising synthetic oligonucleotides can be 
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Nucleotide bases other than the common four nucleotides (A,T,*G or C), or 
the RNA equivalent nucleotide uracil (U), can also be used. For example, it is 
well known that inosine (I) is capable of hybridizing with A f T and G, but not C. 
Examples of other useful nucleotide analogs are known "m the art and may be 
5 found referred to in 37 C.F.R. §1.822. 

Thus, where all four common nucleotides are to occupy a single position 
of a family of oligonucleotides, that is, where the preselected nucleotide 
sequence is designed to contain oligonucleotides that can hybridize to four 
sequences that vary at one position, several different oligonucleotide structures 

10 are contemplated. The composition can contain four members, where a 
preselected position contains A,T,G or C. Alternatively, a composition can 
contain two nucleotide sequence members, where a preselected position 
contains I or C, and has the capacity to hybridize at that position to all four 
possible common nucleotides. Finally, other nucleotides may be included at the 

1 5 preselected position that have the capacity to hybridize in a non-destabilizing 
manner with more than one of the common nucleotides in a manner similar to 
inosine. 

Similarly, larger nucleic acid molecules can be constructed in synthetic 
oligonucleotide pieces, and assembled by complementary hybridization and 
20 ligation, as is well known. 

D. Adenovirus Expression Vector Systems 

The adenovirus vector genome that is encapsulated in the virus particle 
and that expresses exogenous genes in a gene therapy setting is a key 
component of the system. Thus, the components of a recombinant adenovirus 
25 vector genome include the ability to express selected adenovirus structural 
genes, to express a desired exogenous protein, and to contain 

sufficient replication and packaging signals that the genome is 
packaged into a gene delivery vector particle. The preferred 
replication signal is an adenovirus inverted terminal repeat 
30 containing an adenovirus origin of replication, as is well known 

and described herein. 
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Although adenovirus include many proteins, not all adenovirus proteins 
are required for assembly of a recombinant adenovirus particle (vector). Thus, 
deletion of the appropriate genes from a recombinant Ad vector permits 
accommodation of even larger "foreign" DNA segments- 
5 A preferred recombinant adenovirus vector genome is "helper 

independent" so that genome can replicate and be packaged without the help of 
a second, complementing helper virus. Complementation is provided by a 
packaging cell. 

In a preferred embodiment, the adenovirus vector genome does not 
10 encode a functional adenovirus fiber protein. A non-functional fiber gene refers 
to a deletion, mutation or other modification to the adenovirus fiber gene such 
that the gene does not express any or insufficient adenovirus fiber protein to 
package a fiber-containing adenovirus particle without -complementation of the 
fiber gene by a complementing piasmid or packaging cell line. Such a genome is 
15 referred to as a "fiberless" genome, not to be confused with a fiberless particle. 
Alternatively, a fiber protein may be encoded but is insufficiently expressed to 
result in a fiber containing particle. 

Thus, contemplated for use are helper-independent fiberless recombinant 
adenovirus vector genomes that include genes that (a) express all adenovirus 
20 structural gene products but express insufficient adenovirus fiber protein to 

package a fiber-containing adenovirus particle without complementation of said 
fiber gene, (b) express an exogenous protein, and (c) contain an adenovirus 
packaging signal and inverted terminal repeats containing adenovirus origin of 
replication. 

25 The adenovirus vector genome is propagated in the laboratory in the form 

of rDIMA plasmids containing the genome, and upon introduction into an 
appropriate host, the viral genetic elements provide for viral genome replication 
and packaging rather than plasmid-based propagation. Exemplary methods for 
preparing an Ad-vector genome are described in the Examples. 

30 A vector herein includes a nucleic acid (preferably DNA) molecule capable 

of autonomous replication in a cell and to which a DNA segment, e.g., a gene or 
polynucleotide, can be operatively linked to bring about replication of the 
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attached segment. For purposes herein, one of the nucleotide segments to be 
operatively linked to vector sequences encodes at least a portion of a therapeutic 
nucleic acid molecule. As noted above, therapeutic nucleic acid molecules 
include those encoding proteins and also those that encode regulatory factors 
5 that can lead to expression or inhibition or alteration of expression of a gene 
product in a targeted cell. 

1 . Nucleic Acid Gene Expression Cassettes 
In various embodiments, a peptide-coding sequence of the therapeutic 
gene is inserted into an expression vector and expressed; however, it is also 

10 feasible to construct an expression vector which also includes some non-coding 
sequences as well. Preferably, however, non-coding sequences are excluded. 
Alternatively, a nucleotide sequence for a soluble form of a polypeptide may be 
utilized. Another preferred therapeutic viral vector includes a nucleotide 
sequence encoding at least a portion of a therapeutic nucleotide sequence 

15 operatively linked to the expression vector for expression of the coding sequence 
in the therapeutic nucleotide sequence. 

The choice of viral vector into which a therapeutic nucleic acid molecule 
is operatively linked depends directly, as is well known in the art, on the 
functional properties desired, e.g., vector replication and protein expression, and 

20 the host cell to be transformed - these being limitations inherent in the art of 
constructing recombinant DNA molecules. Although certain adenovirus 
serotypes are recited herein in the form of specific examples, it should be 
understood that the use of any adenovirus serotype, including hybrids and 
derivatives thereof are contemplated. 

25 A translatable nucleotide sequence is a linear series of nucleotides that 

provide an uninterrupted series of at Jeast 8 codons that encode a polypeptide in 
one reading frame. Preferably, the nucleotide sequence is a DNA sequence. The 
vector itself may be of any suitable type, such as a viral vector (RNA or DNA), 
naked straight-chain or circular DNA, or a vesicle or envelope containing the 

30 nucleic acid material and any polypeptides that are to be inserted into the -cell. 
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2. Promoters 

As noted elsewhere herein, an expression nucleic acid in an Ad-derived 
vector may also include a promoter, particularly a tissue or cell specific 
promoter, preferably one expressed in ocular cells, particularly photoreceptors, 
5 Promoters contemplaged for use herein include regulatable (inducible) as 

well as constitutive promoters, which may be used, either on separate vectors or 
on the same vector. Some useful regulatable promoters are those of the CREB- 
regulated gene family and include inhibin, gonadotropin, cytochrome c, 
glucagon, and the like. (See, e.g., International PCT application No. WO 
10 96/14061). Preferably the promoter selected is from a photoreceptor-specific 
gene, such as a rhodopsin gene or gene that encodes a protein that regulates 
rhodopsin expression. 
E. Formulation and administration 

Compositions containing therapeutically effective concentrations of 
15 recombinant adenovirus delivery vectors are provided. These are for delivery of 
therapeutic gene products to cells, particularly cells express a particular 50 kDa 
receptor or other receptor with which the vectors interact. These cells include 
cells of the eye and genital tract. Of particular interest are photoreceptor cells of 
the eye. Administration is effected by any means through which contacting with 
20 the photoreceptors is effected. Preferable modes of administration include, but 
are not limited to, subretinal injection, particularly intravitreal injection, to provide 
access to photoreceptor cells. 

The recombinant viral compositions may also be formulated for 
implantation into the anterior or posterior chamber of the eye, preferably the 
25 vitreous cavity, in sustained released formulations, such as those adsorbed to 
biodegradable supports, including collagen sponges, or in liposomes. Sustained 
release formulations may be formulated for multiple dosage administration, so 
that during a selected period of time, such as a month or up to about a year, 
several dosages are administered. Thus, for example, liposomes may be 
30 prepared such that a total of about two to up to about five or more times the 
single dosage is administered in one injection. 
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The vectors are formulated in an ophthalmologically acceptable carrier for 
intraocular, preferably intravitreal, administration in a volume of between about 
0.05 ml and 0.150 ml, preferably about 0.05 and 0.100 ml. 

The composition can be provided in a sealed sterile vial containing an 
5 amount of a compound of formula I, that upon intraocular administration will 
deliver a sufficient amount of viral particles to the photoreceptors in a volume of 
about 50 to 150 //I, containing at least about 10 7 , more preferably at least about 
10 8 plaque forming units in such volume. Typically, the vials will, thus, contain 
about 0.150 ml of the composition. 
10 To prepare compositions the viral particles are dialzyed into a suitable 

ophthalmologically acceptable carrier or viral particles, for example, may be 
concentrated and/or mixed therewith. The resulting mixture may be a solution, 
suspension or emulsion. In addition, the viral particles may be formulated as 
the sole pharmaceutical^ active ingredient in the composition or may be 

1 5 combined with other active agents for the particular disorder treated. 

For administration by intraocular injection or via eyedrops, suitable 
carriers include, but are not limited to, physiological saline, phosphate buffered 
saline (PBS), balanced salt solution (BSS),. lactate Ringers solution, and solutions 
containing thickening and solubilizing agents, such as glucose, polyethylene 

20 glycol, and polypropylene glycol and mixtures thereof. Liposomal suspensions 
may also be suitable as pharmaceutical^ acceptable carriers. These may be 
prepared according to methods known to those skilled in the art. Suitable 
ophthalmologically acceptable carriers are known. Solutions or mixtures 
intended for ophthalmic use may be formulated as 0.01% - 10% isotonic 

25 solutions, pH about 5-7, with appropriate salts [see, e.g.. U.S. Patent No. 
5,1 16,868, which describes typical compositions of ophthalmic irrigation 
solutions and solutions for local application]. Such solutions, which have a pH 
adjusted to about 7.4, contain, for example, 90-100 mM sodium chloride, 4^6 
idM dibasic potassium phosphate, 4-6 mM dibasic sodium phosphate, 8-1 2 mM 

30 sodium citrate, 0.5-1.5 mM magnesium chloride, 1 .6-2.5 mM calcium chloride, 
15-25 mM sodium acetate, 10-20 mM D.L.-sodium £-hydroxybutyrate and 5-^5:5 
mM glucose. 
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The compositions may be prepared with carriers that protect them from 
rapid elimination from the body, such as time release formulations or coatings. 
Such carriers include controlled release formulations, such as, but not limited to, 
microencapsulated delivery systems, and biodegradable,, biocompatible polymers, 
5 such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, 

polyorthoesters, polylactic acid and other types of implants that may be placed 
directly into the anterior or posterior chamber or vitreous cavity of the eye. The 
compositions may also be administered in pellets, such as Elvax pellets 
(ethylene-vinyl acetate copolymer resin). 
10 Liposomal suspensions, including tissue-targeted liposomes, may also be 

suitable as pharmaceutical^ acceptable carriers. For example, liposome 
formulations may be prepared by methods known to those of skill in the art tsee, 
e^fl., Kimm et aL (1 983) Bioch. Bioph. Acta 728:339-398; Assil et aL <1 987) 
Arch Ophthalmol. 105:400; and U.S. Patent No. 4,522,81 1]. The viral particles 
15 may be encapsulated into the aqueous phase of liposome systems. 

The active materials can also be mixed with other active materials, that 
do not impair the desired action, or with materials that supplement the desired 
action or have other action, including viscoelastic materials, such as hyaluronic 
acid, which is sold under the trademark HEALON, which is a solution of a high 
20 molecular weight (MW) of about 3 millions fraction of sodium hyaluronate 
[manufactured by Pharmacia, Inc; see, e.g.. U.S. Patent Nos. 5,292,362, 
5,282,851, 5,273,056, 5,229,127, 4,517,295 and 4,328,803], VISCOAT 
[fluorine-containing (meth)acrylates, such as, 1H,1H,2H,2H-hepta- 
decafluorodecylmethacrylate; see, e^, U.S. Patent Nos. 5,278,126, 5,273,761 
25 and 5,214,080; commercially available from Alcon Surgical, Inc.], ORCOLON 
[see, e^, U.S. Patent No. 5,273,056; commercially available from Optical 
Radiation Corporation], methylcellulose, methyl hyaluronate, polyacrylamide and 
polymethacrylamide [see, e,a., U.S. Patent No. 5,273,751]. The viscoelastic 
materials are present generally in amounts ranging from about 0.5 to 5.0%, 
30 preferably 1 to 3% by weight of the conjugate material and serve to coat and 
protect the treated tissues. The compositions may also include a dye, such as 
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methylene blue or other inert dye, so that the composition can be seen when 
injected into the eye. Additional active agents may be included. 

The compositions can be enclosed in ampules, disposable syringes or 
multiple or single dose vials made of glass, plastic or other suitable material. 
5 Such enclosed compositions can be provided in kits. In particular, kits 

containing vials, ampules or other containers, preferably disposable vials with 
sufficient amount of the composition to deliver about 0.100 ml thereof, and 
disposable needles, preferably self sealing 25-30 gauge needles, are provided 
herein. 

10 Finally, the compounds may be packaged as articles of manufacture 

containing packaging material, typically a vial, an ophthalmologically acceptable 
composition containing the viral particles and a label that indicates the 
therapeutic use of the composition. 

Also provided are kits for practice of the methods herein. The kits 

15 contain one or more containers, such as sealed vials, with sufficient composition 
for single dosage administration, and one or more needles, such as self sealing 
25-33 gauge needles, preferably 33 gauge or smaller needles, precisely 
calibrated syringes or other precisely calibrated delivery device, suitable for 
intravitreal injection. 

20 Administration of the composition is preferably by intraocular injection, 

although other modes of administration may be effective, if the sufficient 
amount of the compound achieves contact with the vitreous cavity. Intraocular 
injection may be effected by intravitreal injection, aqueous humor injection or 
injection into the external layers of the eye, such as subconjunctival injection or 

25 subtenon injection, or by topical application to the cornea, if a penetrating 
formulation is used. 
Administration 

The compositions containing the compounds are administered 
intraocularly or by other means, such as topically in the form of penetrating 
30 eyedrops, whereby contact of the recombinant vectors with the aqueous humor 
is effected. Intraocular administration may be effected by intravitreal injection, 
aqueous humor injection, injection into the external layers of the eye, such as 
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subconjunctival injection or subtenon injection, preferably in free form, but, 
alternatively, in liposomes or other sustained drug delivery device. 
Administration is preferably by intravitreal injection, preferably through self 
sealing, 25-30 gauge needles or other suitably calibrated delivery device* 
5 Injection into the eye may be through the pars plana via the self-sealing needle. 

It is further understood that, for any particular subject, specific dosage 
regimens should be adjusted over time according to the individual need and the 
professional judgment of the person administering or supervising the 
administration of the recombinant viruses, and that the concentration ranges set 
10 forth herein are exemplary only and are not intended to limit the scope or 
practice of the claimed methods 
F. Diseases, Disorders and therapeutic products 
1 . Disease and disorders 

Retinitis pigmentosa 

1 5 Methods for specifically or selectively targeting recombinant adenovirus 

vectors for delivery of gene products, particularly therapeutic products are 
provided herein. These methods are particularly suitable for targeting cells that 
express receptors that are selectively recognized by Ad virus of subgroup D 
viruses, particularly Ad37. It is shown herein that these viruses selectively 

20 recognize receptors on cells, such as conjunctival cells and photoreceptors, that 
are not recognized by other adenoviruses. Hence, methods for targeting to 
these ceil types by providing vectors that are packaged in viral particles that 
contain a sufficient portion of a fiber protein from one of these Ad serotypes to 
bind to these receptors. These methods are useful for targeting to 

25 photoreceptors and for treating ocular disorders, including, but are not limited to, 
inherited and acquired retinal, neovascular degenerative diseases (see table 
below). 

It is estimated that 1 in 3,500 individuals in the United States suffer from 
one of the pigmented retinopathies. This group of retinal diseases, commonly 
30 called retinitis pigmentosa, is characterized by progressive loss of peripheral and 
night vision. Patients may be affected at almost any age and it is not 
uncommon to experience symptoms in early childhood in certain inherited forms. 




PCT/EP01/04863 



-40- 

It has been shown that there are a variety of mutations in genes expressed in 
the photoreceptors, including genes in the rhodopsin gene and pathway that 
appear to be responsible for these diseases. In addition to mutations in 
rhodopsin, changes in the retinal pigmented epithelial (RPE) cells, also undergo 
5 degenerative changes and can form clumps of pigment that give rise to the 
characteristic pigmentary changes seen in patients with RP. 

Angiogenesis and ocular diseases and disorders 
The vast majority of diseases that cause catastrophic loss of vision do so 
as a result of ocular neovascularization; age related macular degeneration 

10 (ARMD) affects 12-15 million American over the age of 65 and causes visual 
loss in 10-15% of them as a direct effect of choroidal (sub-retinal) 
neovascularization. The leading cause of visual loss for Americans under the 
age of 65 is diabetes; 1 6 million individuals in the United States are diabetic and 
40,000 per year suffer from ocular complications of the disease, which often are 

15 a result of retinal neovascularization. Laser photocoagulation has been effective 
in preventing severe visual loss in subgroups of high risk diabetic patients, but 
the overall 10 year incidence of retinopathy remains essentially unchanged, For 
patients with choroidal neovascularization due to ARMD or inflammatory eye 
disease, such as ocular histoplasmosis, photocoagulation, with few exceptions, 

20 is ineffective in preventing visual loss. While recently developed, non- 
destructive photodynamic therapies hold promise for temporarily reducing 
individual loss in patients with previously untreatable choroidal 
neovascularization, only 61.4% of patients treated every 3-4 months had 
improved or stabilized vision compared to 45.9% of the placebo-treated group. 

25 In the normal adult, angiogenesis is tightly regulated and limited to wound 

healing, pregnancy and uterine cycling. Angiogenesis is turned on by specific 
angiogenic molecules such as basic and acidic fibroblast growth factoMFGF), 
vascular endothelial growth factor (VEGF), angiogenin, transforming growth 
factor (TGF), tumor necrosis factor-cr (TNF-or) and platelet derived growth factor 

30 (PDGF). Angiogenesis can be suppressed by inhibitory molecules such as 

interferon-cr, thrombospondin-1, angiostatin and endostatin. It is the balance of 
these naturally occurring stimulators and inhibitors that controls the normally 
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quiescent capillary vasculature. When this balance is upset, as in certain disease 
states, capillary endothelial cells are induced to proliferate, migrate and 
ultimately differentiate. 

Angiogenesis plays a central role in a variety of diseases, including, but 
5 are not limited to, cancer and ocular neovascularization. Sustained growth and 
metastasis of a variety of tumors has also been shown to be dependent on the 
growth of new host blood vessels into the tumor in response to tumor derived 
angiogenic factors. Proliferation of new blood vessels in response to a variety of 
stimuli occurs as the dominant finding in the majority of eye diseases that blind, 

10 such as, but are not limited to, proliferative diabetic retinopathy (PDR), ARMD, 
rubeotic glaucoma, interstitial keratitis and retinopathy of prematurity. In these 
diseases, tissue damage can stimulate release of angiogenic factors resulting in 
capillary proliferation. VEGF plays a dominant role in iris neovascularization and 
neovascular retinopathies. While reports clearly show a correlation between 

1 5 intraocular VEGF levels and ischemic retinopathic ocular neovascularization, *FGF 
likely plays a role. Basic and acidic FGF are known to be present in the normal 
adult retina, even though detectable levels are not consistently correlated with 
neovascularization. This may be largely due to the fact that TCF binds very 
tightly to charged components of the extracellular matrix and may not be readily 

20 available in a freely diffusible form that would be detected by standard assays of 
intraocular fluids. 

A final common pathway in the angiogenic response involves integrin- 
mediated information exchange between a proliferating vascular endothelial cell 
and the extracellular matrix. This class of adhesion receptors, called integrins, 

25 are expressed as heterodimers having an a and fi subunit on all cells. One such 
integrin, flgf? 3 , is the most promiscuous member of this family and allows 
endothelial cells to interact with a wide variety of extracellular matrix 
components. Peptide and antibody antagonists of this integrin inhibit 
angiogenesis by selectively inducing apoptosis of the proliferating vascular 

30 endothelial cells. Two cytokine-dependent pathways of angiogenesis exist and 
may be defined by their dependency on distinct vascular cell integrins, aj? 3 and 
ffjfrs- Specifically, basic FGF- and VEGF-induced angiogenesis depend on integrin 
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aj9 3 and aj? 5f respectively, since antibody antagonists of each integrin 
selectively block one of these angiogenic pathways in the rabbit corneal and 
chick chorioallantoic membrane (CAM) models. Peptide antagonists that block 
all cr v integrins inhibit FGF- and. VEGF-stimulated angiogenesis. While normal. 
5 human ocular blood vessels do not display either integrin, aJ3 3 and aj9 5 integrins 
are selectively displayed on blood vessels in tissues from patients with active 
neovascular eye disease. While only aJ3 3 was consistently observed in tissue 
from patients with ARMD,cg? 3 and a^ 5 were present in tissues from patients 
with PDR. Systemically administered peptide antagonists of integrins blocked 

10 new blood vessel formation in a mouse model of retinal vasculogenesis. 

in addition to adhesion events described above, cell migration through the 
extracellular matrix also depends on proteolysis: Matrix metalloproteinases are a 
family of zinc-requiring matrix-degrading enzymes that include the collagenases, 
gelatinases and stromelysins, all of which have been implicated in invasive cell 

1 5 behavior. Invasive cell processes such as tumor metastasis and angiogenesis 
have been found to be associated with the expression of integrins and MMP-2, 
MMP-2 are all found throughout the eye where they may interact to maintain a 
quiescent vasculature until the balance is upset, resulting in pathological 
angiogenesis. A non-catalytic C-terminal hemopexin-like domain of MMP-2 
, 20 (PEX) can block cell surface collagenolytic activity and inhibit angiogenesis in the 
CAM model by preventing localization of MMP-2 to the surface of invasive 'cells 
through interaction with the integrin aJ3 3 . 

Hence, anti-angiogenic agents have a role in treating retinal degeneration 
to prevent the damaging effects of these trophic and growth factors. 

25 Angiogenic agents, also have a role in promoting desirable vascularization to 
retard retinal degeneration by enhancing blood flow to cells. 

Members of adenovirus subgroup D, Ad8, 19A, and 37, are infectious 
agents that cause particularly severe cases of epidemic keratoconjunctivitis 
(EKC) (Arnberg etal. (1998) Virology 227:239-244; Curtis et at. (1998) 

30 J.Med.Microbiol. 47:91-94; Ritterband et ah (1998) Rev. Med. Virol, ff: 187-20 1; 
and Takeuchi etal. (1999) J. Clin. Microbiol. 37:3392-3394). There is no 
effective treatment for this debilitating and contagious disease andCKC 
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continues to be a problem in ophthalmology clinics worldwide (Curtis et a/. 
(1998) J. Med. Microbiol. 47:91-94, Lukashok eta/. (1998) 

Curr.Clin.Top.lnfec.Dis. 75:286-304). Hence the vectors herein may be used for 
treating the disease. 

Table 3 

Candidate targets for ocular disease therapy 



— _ 

CANDIDATE TARGETS FOR OCULAR DISEASE THERAPY 


Disease 


Candidate target(s) 


Retinitis pigmentosa 


Rhodopsin gene, and genes that 
regulate expression thereof 
rcfe/peripherin 


Stargardt's disease 


rim protein (ARC protein) 






Choroideremia 


rab geranylgeranyl transferase 
CHM, TCD, CHML* 


Gyrate Atrophy 


ornithine aminotransferase 


Macular dystrophy 


rcfe/peripherin 



10 



15 



* see, w MSR6-yeast homologue of the choroideraemia gene/ Nature Genetics 
3:193-4(1993) 

TABLE 4 



Other Diseases 



20 



Exudative Choroidal Diseases 



ICSC, fluorescein angiogram 



ICSC with large serious detachment of RPE (retinal pigmented ep ithelium) 
ICSC with bullous retinal detachment 



Macular drusen, exudative, confluent 



25 



Drusen, sub-RPE choroidal neovascularization 



Drusen, notched serous detachment of RPE 
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Other Diseases 



Drusen, notched serous and hemorrhagic detachment of RPE 



Drusen, serous and hemorrhagic detachment of RPE and retina 



Drusen, organized RPE detachment causing bullous retinal detachment 



Drusen, geographic atrophy of RPE 



Drusen, exudative and cuticular, vitelliform macular detachment 



Drusen, cuticular, large vitelliform macular detachment 



North Carolina dystrophy with macular staphyloma 



North Carolina dystrophy with macular staphyloma 



Angioid streaks, pseudoxanthoma elasticum (PXE), CNVM 



10 



Angioid streaks, PXE, large notched retinal detachment 



Myopic degeneration, Foerster-Fuchs spot 



Presumed ocular histoplasmosis syndrome (POHS) 



Submacular bacterial abscess 



Toxocara canis, subretinal granuloma 



15 



Serpiginous (geographic) choroiditis 



Posterior scleritis 



Harada's disease 



Posterior sympathetic uveitis 



Benign reactive lymphoid hyperplasia of uveal tract 



20 



Choroidal ruptures and CNVM 



Cavernous hemangioma of choroid 



Choroidal osteoma 



Choroidal nevus, serous macular detachment 



Choroidal nevus with CNVM 



25 



Diffuse sclerochoroidal melanocyte nevus 



Choroidal melanoma with serous detachment of RPE 



Metastatic lung carcinoma to choroid 



Sub-RPE reticulum cell sarcoma 
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Other Diseases 



RPE tear, idiopathic choroidal neovascularization 



Heredodystrophic Disorders Affecting RPE & Retina 



Best's vitelliform macular dystrophy 



Best's vitelliform macular dystrophy with CNVM 



Best's vitelliform macular dystrophy, multiple lesions 



Adult-onset vitelliform foveomacular dystrophy 



Pattern dystrophy simulating fundus flavimaculatus 



Stargardt's disease (fundus flavimaculatus) 



Asteroid macular dystrophy 



10 



Sjogren-Larssen syndrome 



Oguchi's disease, light-adapted state 



Oguchi's disease, dark-adapted state 



Fundus albipunctatus 



Retinitis pigmentosa, cystoid macular edema 



15 



Crystalline tapetoretinal dystrophy 



Choroideremia 



Goldmann-Favre syndrome 



Sex-linked juvenile retinoschisis 



Perivenous retinitis pigmentosa 



20 



Retinal Vascular Disorders 



Retinal arteriovenous aneurysm 



Central retinal artery occlusion 



Cilioretinal artery obstruction 



Ischemic retinopathy in systemic lupus erythematosus 



25 



Ischemic retinopathy in scleroderma 



Hemorrhagic detachment of internal limiting membrane, hypertensive 
retinopathy 



Acquired retinal arterial macroaneurysm 
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Other Diseases 



Cystoid macular edema, aphakic 



Cystoid macular edema, nicotinic acid maculopathy 



Congenital retinal telangiectasis 



Acquired bilateral juxtafoveal telangiectasis 



Acquired bilateral juxtafoveal obliterative telangiectasis 



Diabetic optic neuropathy 



X-ray radiation exudative retinopathy 



Sickle cell SC disease, macular hemorrhage 



Retinal arterial aneurysms, arteritis, neuroretinitis 



10 



Branch retinal vein obstruction (BRVO) 



BRVO, exudative maculopathy 



BRVO, optic disc new vessels, photocoagulation 



Waldenstrom's macroglobulinemia 



Inflammatory Diseases of the Retina and Choroid 



15 



Luetic retinal vasculitis 



Focal Candida retinal abscess 



Toxoplasmosis, atrophic chorioretinal scar 



Toxoplasmosis retinitis and macular detachment 



Toxoplasmosis scar, CNVM, macular detachment 



20 



Diffuse unilateral subacute neuroretinitis, small worm 



Diffuse unilateral subacute neuroretinitis, large worm 



Cytomegalic inclusion disease, papillitis 



Acute posterior multifocal placoid pigment epitheliopathy 



Acute macular neuroretinitis 



25 



Sarcoid retinitis 



Sarcoid papillitis 



Behcet's disease 



Vitiliginous (bird-shot) chorioretinitis 
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Other Diseases 



Multifocal choroiditis and panveitis (pseudo-POHS) 



Retinal and Pigment Epithelial Hamartomas 



Congenital grouped albinotic RPE spots 



Congenital hyperplasia of RPE 



Combined RPE and retinal hamartoma, juxtapapillary 



Combined RPE and retinal hamartoma, peripheral 



Cystic astrocytoma, juxtapapillary 



Astrocytoma, macula 



Astrocytoma, juxtapapillary 



10 



Cavernous hemangioma of retina 



Juxtapapillary sessile retinal capillary hemangioma 



Juxtapapillary endophytic retinal capillary hemangioma 



Other Tumors of the Choroid 



Choroidal metastasis 



15 



Choroidal osteoma 



Choroidal hemangioma 



Miscellaneous uveal tumors 



Intraocular Lymphoid Tumors 



The leukemias and lymphomas 



20 



Tumors of the Vitreous 



Non-Hodgkins ("reticulum cell") lymphoma 



Tumor involvement of the vitreous cavity 



Macular Disease 



Age-related macular degeneration - atrophic form 



25 



Exudative age-related macular degeneration 



Choroidal neovascular membrane in degenerative myopia 
Central serous retinopathy 
Macular hole 
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Other Diseases 



Macular dystrophies 



Retinal Vascular Disease 



Etiologic mechanisms in diabetic retinopathy 



Background diabetic retinopathy 



Proliferative diabetic retinopathy 



Retinal arterial obstructive disease 



Central retinal vein occlusion 



Retinal branch vein occlusion 



Pregnancy and retinal disease 



10 



Pregnancy-induced hypertension 



Hypertension 



The rheumatic disease 



Parafoveal telangiectasis 



Coats disease 



15 



Disseminated intravascular systemic coagulopathy and related vasculopathies 



Hemoglobinopathies 



Retinopathy of prematurity 



Acquired retinal macroaneurysms 



20 



Eales disease 



Radiation retinopathy 



The ocular ischemic syndrome 



Inflammatory Disease 



Ocular toxoplasmosis 



Ocular toxocariasis 



25 



Ocular cysticercosis 



Cytomegalovirus infections of the retina 



Retinal and ophthalmologic manifestations of AIDS 



Acute retinal necrosis syndrome 
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Other Diseases 



Endogenous fungal infections of the retina and choroid 



Pars planitis 



Syphilis and tuberculosis 



Diffuse unilateral subacute neuroretinitis 



Scleritis 



Birdshot retinochoroidopathy 



Punctate inner choroidopathy 



Sarcoidosis 



10 



Acute multifocal placoid pigment epitheliopathy 



Geographic helicoid peripapillary choroidopathy «3HPC): serpiginous 
choroiditis 



Sympathetic ophthalmia 



Vogt-Koyanigi-Harada syndrome (uveomeningitic syndrome) 



15 



Ciliochoroidal (uveal) effusion 



Reproduced from: Stereoscopic Atlas of Ocular Diseases Diagnosis and Treat- 
ment, 2nd Edition, J. Donald 0. Gass, Vo). 1 & 2, C.V. Mosley Co. (1987); 
and Retina Vol. II, Editor, Stephen J. Ryan, Medical Retina, C.V. Moslay Co. 
(1989). 

20 2. Therapeutic products 

Therapeutic products include but are not limited to, wild-type genes that 
are defective in ocular disorders, such as rhodopsin, or fragments thereof 
sufficient to correct the genetic defect, trophic factors, including growth factors, 
inhibitors and agonists of trophic factors, anti-apoptosis factors and other 

25 products described herein or known to those of skill in the art to be useful for 
treatment of disorders of the eye or that can be treated by a product expressed 
by a photoreceptor. 



OCULAR GENE THERAPY STRATEGIES J 


GENERAL DISEASE 


EXAMPLES 


STRATEGY J 
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OCULAR GENE THERAPY STRATEGIES 


Hereditarv retinal and 
macular degeneration 


• Retinitis Diamentosa 

• Stargardt's disease 

• Other macular 
dystrophies 


Growth factors (e a 
GDNF) 

anti-apoptotic. factors ] 
[e.g., bcl2 gene) 
Stargardt Disease Gene 
(ABCR) r 


Neovascular 


• Diabetes 

• Choroidal 
neovascularization 


Anti-angiogenesis 
factors 


Anti-tumor 


Retinoblastoma 


Antiproliferant 


Glaucoma 


Nerve fiber layer 
atrophy 


Neuroprotective agent 



ee Allikmets et a/. (1997) Science 277:1805-1807. 



For example, for treatment of retinitis pigmentosa the adenovirus vector 
can deliver a wild-type rhodopsin gene or a growth factor or trophic factor, such 
as ciliary neurotrophic factor CNTF; for treatment of Stargardt's disease, the 
10 vector can deliver a wild type ABCR (also called STGD1) or a growth factor or 
anti-angiogenic agent; for diabetic retinopathies, retinal vascularization the 
vector can deliver growth factors, such as a TGF (TGFjP), to prevent 
degeneration. 

15 The following examples are included for illustrative purposes only and are 

not intended to limit the scope of the invention. 

EXAMPLE 1 

Preparation of Adenovirus Packaging Cell Lines 

Cell lines that are commonly used for growing adenovirus are useful as 

20 host cells for the preparation of adenovirus packaging cell lines. Preferred cells 
include 293 cells, an adenovirus-transformed human embryonic kidney cell line 
obtained from the ATCC, having Accession Number £RL 1573; HeLa, a human 
epithelial carcinoma cell line (ATCC Accession Number CCL-2); A549, a human 
lung carcinoma cell line (ATCC Accession Number OCL 1889); and other 

25 epithelial-derived cell lines. As a result of the adenovirus transformation, the 
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293 cells contain the E1 early region regulatory gene. All cells were maintained 

in complete DMEM + 10% fetal calf serum unless otherwise noted. 

These cell lines allow the production and propagation of adenovirus-based 

gene delivery vectors that have deletions in preselected gene regions and that 

5 are obtained by cellular complementation of adenoviral genes. To provide the 

desired complementation of such deleted adenoviral genomes in order to 

generate a viral vector, plasmid vectors that contain preselected functional units 

have been designed. Such units include but are not limited to €1 early region, 

E4 and the viral fiber gene. The preparation of plasmids providing such ^ 

10 complementation, thereby being "complementary plasmids or constructs," that 

are stably inserted into host cell chromosomes are described below. 

A. Preparation of an E4-Expressing Plasmid for Complementation of 
E4-Gene-Deleted Adenoviruses 

The viral E4 regulatory region contains a single transcription unit that is 

1 5 alternately spliced to produce several different mRNA products. The 

E4-expressing plasmid prepared as described herein and used to transfect the 
293 cell line contains the entire E4 transcription unit. A DNA fragment 
extending from 1 75 nucleotides upstream of the €4 transcription start site 
including the natural E4 promoter to 153 nucleotides downstream of the €4 

20 polyadenylation signal including the natural 1E4 terminator signal, corresponding 

to nucleotides 32667-35780 of the adenovirus type 5 (hereinafter referred to as £ 
Ad5) genome as described in Chroboczek et al. [Virol., 186:280-285 (1992), 
GenBank Accession Number M73260), was amplified from Ad5 genomic DNA, 
obtained from the ATCC, via the polymerase chain reaction (PCR). Sequences of 

25 the primers used were 5XGGTACACAGAATTCAGGAX3ACACAACTCC3' 
(forward or 5' primer referred to as €4L) (SEQ ID NO: 1) and 
5 f GCCTGGATCCGGGAAGTTACGTAACGTGGGAAAAC3 # (SEQ ID NO: 2) 
(backward or 3' primer referred to as E4R). To facilitate cloning of the PCR 
fragment, these oligonucleotides were designed to create new sites for the 

30 restriction enzymes EcoRI and BamHI, respectively, as indicated with underlined 
nucleotides. DNA was amplified via PCR using 30xycles of 92 C for 1 minute, 
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50 C for 1 minute, and 72 C for 3 minutes resulting in amplified full-length €4 
gene products. 

The amplified DNA E4 products were then digested with €coRI and 
BamHI for cloning into the compatible sites of pBluescript/SK + by standard 
5 techniques to create the plasmid pBS/£4. A 2603 base pair (bp)xassette 
including the herpes simplex virus thymidine kinase promoter, the hygromycin 
resistance gene, and the thymidine kinase polyadenylation signal was excised 
from the plasmid pMEP4 (Invitrogen, San Diego, CA) by digestion with Fspl 
followed by addition of BamHI linkers (5TGCGGATCCGCG3') (SEQ ID NO: 3) 
10 for subsequent digestion with BamHI to isolate the hygromycin-containing 
fragment. 

The isolated BamHI-modified fragment was then cloned into the BamHI 
site of pBS/E4 containing the E4 region to create the plasmid pE4/Hygro 
containing 8710 bp. The pE4/Hygro plasmid has been deposited with the ATCC 

15 under accession number 97739. The complete nucleotide sequence of 

pE4/Hygro is set forth in SEQ ID NO: 4. Position number 1 of the linearized 
vector corresponds to approximately the middle portion of the pBS/SK + 
backbone. The 5' and 3' ends of the £4 gene are located at respective 
nucleotide positions 3820 and 707 of SEQ ID NO: 4 while the 5' and 3' ends of 

20 the hygromycin insert are located at respective nucleotide positions 3830 and 

6470. In the clone that was selected for use, the E4 and hygromycin resistance 

genes were divergently transcribed. 

B. Preparation of a Fiber-Expressing Plasmid for Complementation of 
Fiber-Gene-Deleted Adenoviruses 

25 To prepare a fiber-encoding construct, primers were designed to amplify 

the fiber coding region from Ad5 genomic DNA with the addition of unique 
BamHI and Notl sites at the 5' and 3' ends of the fragment, respectively. The 
Ad5 nucleotide sequence is available with the GenBank Accession Number 
M18369. The 5' and 3' primers had the respective nucleotide sequences of 

30 5 r ATG GGATCCA AGATGAAGCGCGCAAGACCG3' <SEQ ID NO: 5) and 

5'CATAACGCGGCC^TTCTTTATTCTTGGGC3' (SEQ ID NO: 6), where the 
inserted BamHI and Notl sites are indicated by underlining. The 5' primer also 
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contained a nucleotide substitution 3 nucleotides 5' of the second ATG codon iC 
to A) that is the initiation site. The nucleotide substitution was included so as to 
improve the consensus for initiation of fiber protein translation. 

The amplified DNA fragment was inserted into the BamHI and WoU sites 
5 of pcDNA3 (Invitrogen) to create the plasmid designated pCDNA3/Fiber having 
7148 bp. The parent plasmid contained the CMV promoter, the bovine growth 
hormone (BHG) terminator and the gene for conferring neomycin resistance. The 
viral sequence included in this construct corresponds to nucleotides 
31040-32791 of the Ad5 genome. 
10 The complete nucleotide sequence of pCDNA3/Fiber is listed in SEQ ID 

NO: 7 where the nucleotide position 1 corresponds to approximately the middle 
of the pcDNA3 vector sequence. The 5' and 3' ends of the fiber gene are 
located at respective nucleotide positions 916 with ATG and 2661 with TAA. 

To enhance expression of fiber protein by the constitutive CMV promoter 
15 provided by the pcDNA vector, a Bglll fragment containing the tripartite leader 
(TPL) of adenovirus type 5 was excised from pRD1 12a (Sheay et al., 
BioTechniques, 15:856-862 (1993) and inserted into the BamHI site of 
pCDNA3/Fiber to create the plasmid pCLF having 7469 bp. The adenovirus 
tripartite leader sequence, present at the 5' end of all major late adenoviral 
20 mRNAs as described by Logan et a!., Proc.JJati. Acad. ScL, USA, 81 :365S- 

3659 (1984) and Berkner, BioTechniques, 5:616-629 (1988), also referred to as 
a "partial TPL" since it contains a partial exon 1, shows correspondence with the 
Ad5 leader sequence having three spatially separated exons corresponding to 
nucleotide positions 6081-6089 (the 3' end of the first leader segment), 71 1 1- 
25 7182 (the entire second leader segment), and 9644-9845 (the third leader 

segment and sequence downstream of that segment). The corresponding cDNA 
sequence of the partial tripartite leader sequence present in pCLF is included in 
SEQ ID NO: 8 bordered by BamHI/Bglll 5 # and 3' sites at respective nucleotide 
positions 907-912 to 1228-1233. The nucleotide sequence of an isolated partial 
30 TPL is also listed separately as SEQ ID No. 22 with the noted 5' and 3' 

restriction sites and with the following nucleotide regions identified: 1-6 nt Bglll 
site; 1-18 nt polylinker; 19-27 nt last 9 nt of the first leader segment (exon 1); 
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28-99 nt second leader segment (exon 2); 100-187 nt third leader segment 
(exon 3); 188-301 nt contains the nt sequence immediately following the third 
leader in the genome with an unknown function; and 322-327 nt Bglll site. 
The pCLF plasmid has been deposited with the ATCC as described in 
5 Example 4. The complete nucleotide sequence of pCLF is listed in S€Q ID NO: 8 
where the nucleotide position 1 corresponds to approximately the middle of the 
pcDNA3 parent vector sequence. The 5' and 3' ends of the Ad5 fiber gene are 
located at respective nucleotide positions 1237-1239 with ATG and 2980-2982 
with TAA. 

10 C. Generation of an Adenovirus Packaging Cell Line Carrying Plasmids 

Encoding Functional E4 and Fiber Proteins 

The 293 cell line was selected for preparing the first adenovirus 

packaging line as it already contains the E1 gene as prepared by Graham et at., 

J. Gen. Virol., 35:59-74 (1977) and as further characterized by Spector, Virol., 

15 730:533-538 (1983). Before electroporation, 293 cells were grown in RPMI 
medium + 10% fetal calf serum. Four x 106 cells were electroporated with 20 
//g each of pE4/Hygro DNA and pCLF DNA using a BioRad GenePulser and 
settings of 300 V, 25 //F. DNA for electroporation was prepared using the 
Qiagen system according to the manufacturer's instructions (Bio-Rad, Richmond, 

20 CA). 

Following electroporation, cells were split into fresh complete DMEM + 
10% fetal calf serum containing 200//g/ml Hygromycin B (Sigma, i>t. Louis, 
MO). 

From expanded colonies, genomic DNA was isolated using the 
25 "MICROTURBOGEN" system (Invitrogen) according to manufacturer's 

instructions. The presence of integrated E4 DNA was assessed by PCR using 
the primer pair E4R and ORF6L (5'TGCTTAAGCGGCCGCGAA*GGAGA AGTCC3') 
(SEQ ID NO: 9), the latter of which is a 5' forward primer near adenovirus 5 
open reading frame 6. 
30 One clone, designated 211, was selected exhibiting altered growth 

properties relative to that seen in parent cell line 293. The 21 1 clone-contained 
the product,indicating the presence of inserted DNA corresponding 4o most, if 
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not all, of the E4 fragment contained in the pE4/Hygro plasmid. The 21 1 cell 
line has been deposited with the ATCC as described in Example 4. This line was 
further evaluated by amplification using the primer pair E4L/E4R described 
above r and a product corresponding to the full-length £4 insert was detected. 
5 Genomic Southern blotting was performed on DNA restricted with EcoRI and 
Bam HI. The E4 fragment was then detected at approximately one copy/genome 
compared to standards with the EcoRI/BamHI E4 fragment as cloned into pBS/E4 
for use as a labeled probe with the Genius system according to manufacturer's 
instructions (Boehringer Mannheim, Indianapolis, IN). In DNA from the 21 1 cell 
10 line, the labeled internal fragment pE4/Hygro hybridized with the isolated €4 

sequences. In addition, the probe hybridized to a larger fragment which may be 
the result of a second insertion event. 

Although the 21 1 cell line was not selected by neomycin resistance, thus 
indicating the absence of fiber gene, to confirm the lack of fiber gene, the 21 1 
1 5 cell line was analyzed for expression of fiber protein by indirect 

immunofluorescence with an anti-fiber polyclonal antibody and a FITC-labeled 
anti-rabbit IgG (KPL) as secondary. No immunoreactivity was detected. 
Therefore, to generate 21 1 clones containing recombinant fiber genes, the 21 1 
clone was expanded by growing in RPMI medium and subjected to additional 
20 electroporation with the fiber-encoding pCLF plasmid as described above. 

Following electroporation, cells were plated inDMEM + 10% fetal calf 
serum and colonies were selected with 200 //g/ml G41 8 {Gibco, Oaithersburg, 
MD). Positive cell lines remained hygromycin resistant. These candidate 
sublines of 21 1 were then screened for fiber protein expression by indirect 
25 immunofluorescence as described above. The three sublines screened, 21 1 A, 
21 1B and 21 1R, along with a number of other sublines, all exhibited nuclear 
staining qualitatively comparable to the positive control of 293 cells infected 
with AdRSV^gal (1 pfu/cell) and stained 24 hours post-infection. 

Lines positive for nuclear staining in this assay were then subjected to 
30 Western blot analysis under denaturing conditions using the same antibody. 

Several lines in which the antibody detected a protein of the predicted molecular 
weight (62 kd for the Ad5 fiber protein) were selected for further study including 
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21 1 A, 21 1 B and 21 1 R. The 21 1 A cell line has been deposited with ATCC as 

described in Example 4. 

Immunoprecipitation analysis using soluble nuclear extracts from these 

three cell lines and a seminative electrophoresis system demonstrated that the 

5 fiber protein expressed is in the functional trimeric form characteristic of the 

native fiber protein. The predicted molecular weight of a trimerized Tiber is 1 86 

kd. Under denaturing conditions, the trimeric form was destroyed resulting in 

detectable fiber monomers. Those clones containing endogenous €1, newly 

expressed recombinant E4 and fiber proteins were selected for use in 

10 complementing adenovirus gene delivery vectors having the corresponding 

adenoviral genes deleted as described in Example 2. 

D. Preparation of an E1 -Expressing Plasmid for Complementation of 
El -Gene-Deleted Adenoviruses 

In order to prepare adenoviral packaging cell lines other than those based 

15 on the E1-gene containing 293 cell line as described in Example 1C above, 

plasmid vectors containing E1 alone or in various combinations withE4 and fiber 
genes are constructed as described below. 

The region of the adenovirus genome containing the E1a and E1b gene is 
amplified from viral genomic DNA by PCR as previously described. The primers 

20 used are E1L, the 5' or forward primer, and E1R, the 3' or backward primer, 
having the respective nucleotide sequences 5'CCG AGCTAGC 
X3ACTGAAAATGAG3' (SEQ ID NO: 10) and 5'CCTCTCGAG AGACAGC 
AAGACAC3' (SEQ ID NO: 11). The E1L and E1R primers include the respective 
restriction sites Nhel and Xhol as indicated by the underlines. The sites are used 

25 to clone the amplified E1 gene fragment into the Nhel/Xhol sites in pMAM 
commercially available from Clontech (Palo Alto, CA) to form the plasmid 
pDEX/EI having 11152 bp. 

The complete nucleotide sequence of pDEX/E1 is listed in SEQ ID NO: 12 
where the nucleotide position 1 corresponds to approximately 1454 nucleotides 

30 from the 3' end of the pMAM backbone vector sequence. The pD£X/E1 plasmid 
includes nucleotides 552 to 4090 of the adenovirus genome positioned 
downstream (beginning at nucleotide position 1460 and ending at 4998 in the 
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pDEX/E1 plasmid) of the glucocorticoid-inducible mouse mammary tumor virus 

(MMTV) promoter of pMAM. The pMAM vector contains the E. coli gpt gene 

that allows stable transfectants to be isolated using hypoxanthine/amino- 

pterin/thymidine (HAT) selection- The pMAM backbone occupies nucleotide. 

5 positions 1-1454 and 5005-1 1 152 of SEQ ID NO: 12. 

E. Generation of an Adenovirus Packaging Cell Line Carrying Plasmids 
Encoding Functional E1, and Fiber Proteins 

To create separate adenovirus packaging cell lines equivalent to that of 

the 21 1 sublines, 21 1A, 21 1 B and 21 1R, as described in Example 1C, 

10 alternative cell lines lacking adenoviral genomes are selected for transfection 
with the plasmid constructs as described below. Acceptable host cells include 
A549, Hela, Vero and the like cell lines as described in Example 1 . The selected 
cell line is transfected with the separate plasmids, pDEX/E1and pCLF, 
respectively for expressing E1, and fiber complementary proteins. Following 

15 transfection procedures as previously described, clones containing stable 

insertions of the two plasmids are isolated by selection with neomycin and HAT. 
Integration of full-length copy of the E1 gene is assessed by PCR amplification 
from genomic DNA using the primer set E1L/E1R , as described above. 
Functional insertion of the fiber gene is assayed by staining with the anti-fiber 

20 antibody as previously described. 

The resultant stably integrated cell line is then used as a packaging cell 

system to complement adenoviral gene delivery vectors having the 

corresponding adenoviral gene deletions as described in Example 2. 

F. Preparation of a Plasmid Containing Two or More Adenoviral Genes for 
25 Complementing Gene-Deleted Adenoviruses 

The methods described in the preceding Examples rely on the use of two 

plasmids, pE4/Hygro and pCLF, or, pCLF and pDEX/E1 for generating adenoviral 

cell packaging systems. In alternative embodiments, complementing plasmids 

containing two or more adenoviral genes for expressing of encoded proteins in 

30 various combinations are also prepared as described below. The resultant 

plasmids are then used in various cell systems with delivery plasmids having the 

corresponding adenoviral gene deletions. The selection of packaging cell, 

content of the delivery plasmids and content of the complementing plasmids for 
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use in generating recombinant adenovirus viral vectors thus depends on whether 
other adenoviral genes are deleted along with the adenoviral fiber gene, and, if 
so, which ones. 

1 . Preparation of a Complementing Plasmid Containing Fiber 
5 and El Adenoviral Genes 

A DNA fragment containing sequences for the CMV promoter, adenovirus 

tripartite leader, fiber gene and bovine growth hormone terminator is amplified 

from pCLF prepared in Example 1 B using the forward primer 

5'GACGGATCGGGAGATCTCC3' (SEQ ID NO: 13), that anneals to the 

10 nucleotides 1-19 of the pCDNA3 vector backbone in pCLF, and the backward 
primer 5'CCGCCTCAGAAGCCATAGAGCC3' (SEQ ID NO: 14) that anneals to 
nucleotides 1278-1257 of the pCDNA3 vector backbone. The fragment is 
amplified as previously described and then cloned into the pDEX/E1 plasmid, 
prepared in Example 1D. For cloning in the DNA fragment, the pD£X/£1 vector 

15 is first digested with Ndel, that cuts at a unique site in the pMAM vector 
backbone in pDEX/E1 , then the ends are repaired by treatment with 
bacteriophage T4 polymerase and dNTPs. 

The resulting plasmid containing £1 and fiber genes, designated 
pE1 /Fiber, provides dexamethasone-inducible E1 function as described for 

20 DEX/E1 and expression of Ad5 fiber protein as described above. 

The complete nucleotide sequence of pE1 /Fiber is listed in SEQ ID NO: 15 
where the nucleotide position 1 corresponds to approximately 1459 nucleotides 
from the 3' end of the parent vector pMAM sequence. The 5' and 3' ends of 
the Ad5 E1 gene are located at respective nucleotide positions 1460 and 4998 

25 followed by pMAM backbone and then separated from the Ad5 fiber from pCLF 
by the filled-in blunt ended Ndel site. The 5' and 3' ends of the pCLF fiber gene 
fragment are located at respective nucleotide positions 10922-14223 containing 
elements as previously described for pCLF. 

The resultant pE1 /Fiber plasmid is then used to complement one or more 

30 delivery plasmids expressing E1 and fiber. 

The pE1 /Fiber construct is then used to transfect a selected host ^cell as 
described in Example 1E to generate stable chromosomal insertions preformed as 
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previously described followed by selection on HAT medium. The stable cells are 
then used as packaging cells as described in Example 2. 

2. Preparation of a Complementing Plasmid Containing E4 and 
Fiber Adenoviral Genes 

5 Plasmid pCLF prepared as described in -Example 1 B is partially digested 

with Bglll to cut only at the site in the pCDNA3 backbone. The pE4/Hygro 

plasmid prepared in Example 1 A is digested with BamHI to produce a fragment 

containing E4. The E4 fragment is then inserted into the BamHI site of pCLF to 

form plasmid pE4/Fiber. The resultant plasmid provides expression of the fiber 

10 gene as described for pCLF and E4 function as described for pE4/Hygro. 

A schematic plasmid map of pE4/Fiber, having 10610 bp. The complete 
nucleotide sequence of pE4/Fiber is listed in SEQ ID NO: 1 6 where the 
nucleotide position 1 corresponds to approximately 14 bp from the 3' end of the 
parent vector pCDNA3 backbone sequence. The 5' and 3' ends of the Ad5 €4 

15 gene are located at respective nucleotide positions 21 and 3149 followed by 
fused Bglll/BamHI sites and pCDNA3 backbone including the CMV promoter 
again followed by Bglll/BamHI sites. The adenovirus leader sequence begins at 
nucleotide position 4051 and extends to 4366 followed by fused BamHI/Bglll 
sites and the 5' and 3' ends of the fiber gene located at respective nucleotide 

20 positions 4372 and 6124. 

Stable chromosonal insertions of pE4/Fiber in host cells are obtained as 
described above. 

EXAMPLE 2 

Preparation of Adenoviral Gene Delivery Vectors Using Adenoviral Packaging Cell 
25 Lines 

Adenoviral delivery vectors are prepared to separately lack the 
combinations of E1 /fiber and E4/fiber. Such vectors are more replication- 
defective than those previously in use due to the absence of multiple viral genes. 
A preferred adenoviral delivery vector is replication competent but only via a 
30 non-fiber means is one that only lacks the fiber gene but contains the remaining 
functional adenoviral regulatory and structural genes. Furthermore, these 
adenovirus delivery vectors have a higher capacity for insertion of foreign DNA. 
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A. Preparation of Adenoviral Gene Delivery Vectors Having Specific "Gene 
Deletions and Methods of Use 

To construct the E1/ fiber deleted viral vector containing the LacZ 
reporter gene construct, two new plasmids were constructed. The plasmid pA 
5 E1B/?gal was constructed as follows. A DNA fragment containing the*SV40 

regulatory sequences and E. coli /?-galactosidase gene was isolated from pSV/?gal 
(Promega) by digesting with Vspl, filling the overhanging ends by treatment with 
Klenow fragment of DNA polymerase I in the presence of dNTP's and digesting 
with Bam H1 . The resulting fragment was cloned into the CcoRV and BamHI 

10 sites in the polylinker of pA E1sp1B (Microbix Biosystems, Hamilton, Ontario) to 
form pA E1B £gal that therefore contained the left end of the adenovirus genome 
with the Ela region replaced by the LacZ cassette (nucleotides 6690 to 4151) of 
pSV£ gal. Plasmid DNA may be prepared by the alkaline lysis method as 
described by Birnboim and Doly, Nuc. Acids Res., 7:1513-1523 (1978) or by the 

15 Quiagen method according to the manufacturer's instruction, from transformed 
cells used to expand the plasmid DNA was then purified by CsCI-ethidium 
bromide density gradient centrifugation. Alternatively, plasmid ONAs may be 
purified from E. coli by standard methods known in the art (e.g. see Sambrook et 
al.) 

20 The second plasmid (pDV44), prepared as described herein, is derived 

from pBHGIO, a vector prepared as described by Bett-ef al., Proc. Natl. Acad. 
Sci. r USA, 91:8802-8806 (1994) (see, also International PCT application No, 
WO 95/00655) using methods well known to one of skill in the art. This vector 
is also commercially available from Microbix and and contains an Ad5 genome 

25 with the packaging signals at the left end deleted and the €3 region (nucleotides 
28133:30818) replaced by a linker with a unique site for the restriction enzyme 
Pad. An 11.9 kb BamHI fragment, which contains the right end of the 
adenovirus genome, is isolated from pBHGTO and cloned into the BamHI site of 
pBS/SK( + ) to create plasmid p1 1 .3 having approximately 14,658 bp. The p1 1 .3 

30 plasmid was then digested with Pad and Sail to remove the fiber, €4, and 
inverted terminal repeat (ITR) sequences. 




PCT/EP01/04863 



-61- 

This fragment was replaced with a 3,4 kb fragment containing the ITR 
segments and the E4 gene which was generated by PCR amplification from 
pBHGIO using the following oligonucleotide sequences: 
5 # TGTACACCG GATCCGGCGCACACC3' SEQ ID NO: 17; and 
5 5'CACAACGAGCTC AATTAATTAATTGCCACATCCTC3' SEQ ID NO: 18. 

These primers incorporated sites for Pad and BamHl. Cloning this fragment into 
the Pad and blunt ended Sail sites of the p1 1.3 backbone resulted in a 
substitution of the fused ITRs, E4 region and fiber gene present in pBHGIO, by 
the ITRs and E4 region alone. The resulting p1 1 .3 plasmid containing the ITR 
10 and E4 regions, designated plasmid pDV43a, was then digested with BamHl. 
This BamHl fragment was then used to replace a BamHl fragment in pBHGIO 
thereby creating pDV44 in a pBHGIO backbone. 

In an alternative approach to preparing pDV44 with an additional 
subcloning step to facilitate the incorporation of restriction cloning sites, the 
15 following cloning procedure was performed. pDV44 as above was constructed 
by removing the fiber gene and some of the residual E3 -sequences from pBHGIO 
(Microbix Biosystems). As above, to simplify manipulations, the 1 1.9 kb BamHl 
fragment including the rightmost part of the Ad5 genome was removed from 
pBHGIO and inserted into pBS/SK. The resulting plasmid was termed p1 1.3. 
20 The 3.4 kb DNA fragment corresponding to the E4 region and both ITRs of 
adenovirus type 5 was amplified as described above from pBHGIO using the 
oligonucleotides listed above and subcloned into the vector pCR2.1 (Invitrogen) 
to create pDV42. This step is the additional cloning step to facilitate the 
incorporation of a Sail restriction site. pDV42 was then digested with Pad, 
25 which cuts at a unique site {bold type) in one of the PCR primers, and with^Sall, 
which cuts at a unique site in the pCR2.1 polylinker. This fragment was used to 
replace the corresponding Pacl/Xhol fragment of p1 1 .3 (the pBS polylinker 
adjacent to the Ad DNA fragment contains a unique Xhol site), creating pDV43. 

30 A plasmid designated pDV44 was constructed by replacing the 1 1 .9 kb 

BamHl fragment of pBHGIO by the analogous BamHl fragment of pDV43. As 
generated in the first procedure, pDV44 therefore differs from pBHGIO by the 
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deletion of Ad5 nucleotides 30819:32743 (residual €3 sequences and all but the 
3'-most 41 nucleotides of the fiber open reading frame). 

Thus, to summarize, the cloning procedures described above result in the 
production of a f iber-deleted Ad5 genomic plasmid- (pDV44) that was 
5 constructed by removing the fiber gene and some of the residual €3 sequences 
from pBHGIO. pDV44 contains a wild-type E4 region, but only the last 41 
nucleotides of the fiber ORF (this sequence was retained to avoid affecting 
expression of the adjacent E4 transcription unit). Plasmids pBHGIO and pDV44 
contain unpackageable Ad5 genomes, and must be rescued by cotransfection 

10 and subsequent homologous recombination with DN A carrying functional 

packaging signals. In order to generate vectors marked with a reporter gene, 
either pDV44 or pBHGIO was cptransfected with pAEIBRgal, which contains 
the left end of the Ad5 genome with an SV40-driven B-galactosidase reporter 
gene inserted in place of the E1 region. 

15 In general, and as described below, the method for virus production by 

recombination of plasmids followed by complementation in cell culture involves 
the isolation of recombinant viruses by cotransfection of any one of the 
adenovirus packaging cell systems prepared in Example 1, namely 211 A, 21 1B, 
21 1R, A549, Vero cells, and the like, with plasmids carrying sequences 

20 corresponding to viral gene delivery vectors. 

A selected cell line is plated in dishes and cotransfected with pOV44 and 
pAE1B/? gal using the calcium phosphate method as described by Bett«ef a/., 
Proc. Natl. Acad. ScL, USA, 37:8802-8806 (1994). Recombination between the 
overlapping adenovirus sequences in the two plasmids leads to the creation of a 

25 full-length viral chromosome where pDV44 and pAE1B/? gal recombine to form a 
recombinant adenovirus vector having multiple deletions. The deletion of €1 and 
of the fiber gene from the viral chromosome is compensated for by the 
sequences integrated into the packaging cell genome, and infectious virus 
particles are produced. The plaques thus generated are isolated and stocks of 

30 the recombinant virus are produced by standard methods. 

Because of the fiber deletion, a pDV44-derived virus is 
replication-defective, cells in which it is grown must complement this defect. 
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The 21 1 B cell line (a derivative of 293 cells which expresses the wild-type (wt) 
AD5 fiber and is equivalent to 21 1 A on deposit with ATCC as described in 
Example 4) was used for rescue and propagation of the virus described here. 
pDY44 and pAEIBgal were, cotransf ected into 21 1B. cells, and. the monolayers 
5 were observed for evidence of cytopathic effect (CPE). Briefly, for virus 

construction, cells were transf ected with the indicated plasmids using the ? Gibco 
Calcium Phosphate Transfection system according to the manufacturer's 
instructions and observed daily for evidence of CPE. 

One of a total of 58 transfected dishes showed evidence of spreading cell 
10 death at day 15. A crude freeze-thaw lysate was prepared from these cells and 
the resulting virus (termed Ad5.Bgal.AF) was plaque purified twice and then 
expanded. To prepare purified viral preparations, cells were infected with the 
indicated Ad and observed for completion of CPE. Briefly, at day zero, 21 1B 
cells were plated in DMEM plus 10% fetal calf serum at approximately 1 X TO 7 
15 cells/150 cm 2 flask or equivalent density. At day one, the medium was replaced 
with one half the original volume of fresh DMEM containing the indicated Ad, in 
this case Ad5.Bgal.AF, at approximately 100 particles/cell. At day two, an equal 
volume of medium was added to each flask and the cells were observed for CPE. 
Two to five days after infection, cells were collected and virus isolated by lysis 
20 via four rapid freeze-thaw cycles. Virus was then purified by centrifugation on 
preformed 1 5-40% CsCI gradients (111 ,000 x g for three hours at 4°C). The 
bands were harvested, dialyzed into storage buffer (10 mM Tris-pH 8.1, 0.9% 
NaCI, and 10% glycerol), aliquoted and stored at - 70°C. Purified AdS.BgaLAf 1 
virus particles containing human adenovirus AdS.Bgal.AFgenome (described 
25 further below) have been deposited with the ATCC on January 15, 1999 as 
further described in Example 4. 

For viral titering, as necessary in the below Examples, Ad preparations 
were titered by plaque assay on 21 1 B cells. Cells were plated on 
polylysine-coated 6 well plates at 1.5 x 10 6 cells/well. Duplicate dilutions of 
30 virus stock were added to the plates in 1 ml/well of -complete DMEM. After a 
five hour incubation at 37 °C, virus was removed and the wells overlaid with 2 
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ml of 0.6% low-melting agarose in Medium 199 CGibco). An additional 1 ml of 
overlay was added at five day intervals. 

As a control, the first-generation virus Ad5.fc gal.wt, which is identical to 
Ad5.Bgal.AF except for the fiber deletion, was constructed by cotransfection of 
5 pBHGIO.and pAEIBKgal. In contrast to the low efficiency of recovery of the 
fiberless genome (1/58 dishes), all of 9 dishes cotransfected with pAE1B/?gal 
and pBHGIO produced virus. 

In another embodiment, a delivery plasmid is prepared that does not 
require the above-described recombination events to prepare a viral vector 

10 having a fiber gene deletion. In one embodiment, a single delivery plasmid 
containing all the adenoviral genome necessary for packaging but lacking the 
fiber gene is prepared from plasmid pFG140 containing full-length Ad5 that is 
commercially available from Microbix. The resultant delivery plasmid referred to 
as pFG140-f is then used with pCLF stably integrated cells as described above *o 

1 5 prepare a viral vector lacking fiber. For genetic therapy, the fiber gene can be 
replaced with a therapeutic gene of interest for preparing a therapeutic delivery 
adenoviral vector. Methods for producing a fiberless vector with a complete TPL 
are described in Example 3. 

Vectors for the delivery of any desired gene and preferably a therapeutic 

20 gene are prepared by cloning the gene of interest into the multiple cloning sites 
in the polylinker of commercially available pAEIsplB (Microbix Biosystems), in 
an analogous manner as performed for preparing pE1B/? gal as described above. 
The same cotransfection and recombination procedure is then followed as 
described herein to obtain viral gene delivery vectors as further discussed in later 

25 Examples. 

1 . Characterization of the Ad5./?ga!.AF Genome 
To confirm that the vector genomes had the proper structures and that 
the fiber gene was absent from the Ad5.fcgal.AF chromosome, the DNA isolated 
from viral particles was analyzed. Briefly, purified viral DNA was obtained by 
30 adding 10 iA of 10 mg/ml proteinase K, 40 p\ of 0.5 M EDTA and 50 jj\ of 10% 
SDS to 800 fA of adenovirus-containing culture supernatant. The suspension 
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was then incubated at 55°C for 60 minutes. The solution was then extracted 
once with 

400 /A of a 24:1 mixture of chloroform:isoamyl alchohol. The aqueous phase 

was then removed and precipitated with sodium acetate/ethanol. The pellet was 

5 washed once with 70% ethanol and lightly dried. The pellet was then 

suspended in 40 //I of 10 mM Tris-HCI, pH 8.0, 1 mM-EOTA. Genomic DNA 

from Ad5.Bgal.wt and Ad5.Bgal.AF produced the expected restriction patterns 

following digestion with either EcoRI or with Ndel. Southern blotting, performed a 

with standard methods, with labeled fiber DNA as a probe demonstrated the 

10 presence of fiber sequence in Ad5.Bgal.wt but not in Ad5.Bgal.Af DNA. As a 

positive control, the blot was stripped and reprobed with labeled £4 sequence. 

Fiber and E4 sequences were detected by using labeled inserts from pCLF and 

pE4/Hygro, respectively. E4 signal was readily detectable in both genomes at 

equal intensities. The complete nucleotide sequence of Ad5.Sgal.AF is presented 

15 in SEQ ID NO: 23 and is contained in the virus particle on deposit with ATCC. 

2. Characterization of the Fiberless Adenovirus Ad5.£gal.AF 

To verify that Ad5.Bgal.AF was fiber-defective, 293 cells (which are 

permissive for growth of E1 -deleted Ad vectors but do not express fiber) were 

infected with Ad5.Bgal.AF or with Ad5.Bgal.wt. Twenty-four hours post 

20 infection, the cells were stained with polyclonal antibodies directed either against £ 

fiber or against the penton base protein. Cells infected with either virus were 

stained by the anti-penton base antibody, while only cells infected with the 

Ad5.Bgal.wt control virus reacted with the anti-fiber antibody. This confirms 

that the fiber-deleted Ad mutant does not direct the synthesis of fiber protein. 

25 3. Growth of the Fiber-Deleted Ad5./?gal.AF Vector in 

Complementing Cells 

Ad5.Bgal.AF was found to readily be propagated in 21 1 B cells. As 

assayed by protein concentration, CsCI-purified stocks of either Ad5.Bgal.AF or 

Ad5.Bgal.wt contained similar numbers of viral particles. The particles appeared 

30 to band normally on CsCI gradients. Infectivity of the Ad5.Bgal.AF particles was 

lower than the Ad5.Bgal.wt control, as indicated by an increased particle/PFU 

ratio. Ad5.Bgal.AF was also found to plaque more slowly than the control 
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virus. When plated on 21 1B cells, Ad5.fcgal.wt plaques appeared within 5-7 
days, while plaques of Ad5.Bgal.AF continued to appear until as much as 1*5-18 
days post infection. Despite their slower formation, the morphology of 
Ad5.dgal.AF plaques was essentially normal. 
5 4. Production of Fiberless Ad5.fcgal.AF Particles 

As Ad5.Bgal.AF represents a true fiber null mutation and its stocks are 
free of helper virus, the fiber mutant phenotype was readily investigated. A 
single round of growth in cells (such as 293) which do not produce fiber 
generating a homogeneous preparation of f iberless Ad allowed for the 

10 determination of whether such particles would be stable and/or infectious. 
Either Ad5.Bgal.wt or Ad5.Bgal.AF was grown in 293 or 21 1B cells, and the 
resulting particles purified on CsCI gradients as previously described. 
Ad5.Bgal.AF particles were readily produced in 293 cells at approximately the 
same level as the control virus and behaved similarly on the gradients, indicating 

15 that there was not a gross defect in morphogenesis of fiberless capsids. 

Particles of either virus contained similar amounts of penton base 
regardless of the cell type in which they were grown. This demonstrated that 
fiber is not required for assembly of the penton base complex into virions. The 
Ad5.Bgal.AF particles produced in 293 cells did not contain fiber protein. 

20 21 IB-grown Ad5.Bgal.AF also contained less fiber than the Ad5.Sgal.wt -control 
virus. The infectivities of the different viral preparations on epithelial cells 
correlated with the amount of fiber protein present. The fiberless Ad particles 
were several thousand-fold less infectious than the first-generation vector control 
on a per-particle basis, while infectivity of 21 IB-grown Ad5.Bgal.AF was only 

25 50-100 fold less than that of Ad5.Bgal.wt. These studies confirmed fiber's 

crucial role in infection of epithelial cells via CAR binding. 

5. Composition and Structure of the fiberless Ad5.ftgaU\F 
Particles 

The proteins contained in particles of 293-grown Ad5.Bgal.AF were 
30 compared to those in Ad5.Bgal.wt, to determine whether proteolysis or particle 
assembly was defective in this fiber null mutant. The overall pattern of proteins 
in the fiberless particles was observed to be quite similar to that of a 
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first-generation vector, with the exception of reduced intensity of the composite 

band resulting from proteins Ilia and IV (fiber). The fiberless particles also had a 

reduced level of protein VII. Although substantial amounts of uncleaved 

precursors to proteins VI, VII, and VIII were not seen, it is possible that the 

5 low-molecular weight bands migrating ahead of protein VII represent either 

aberrantly cleaved viral proteins or their breakdown products. 

Cryo-electron microscopy was used to more closely examine the structure 

of the 293 grown Ad5.Bgal.AF and of AdSBgal.wt. The fiber, having an 

extended stalk with a knob at the end, was faintly visible in favorable 

orientations of wild-type Ad5 particles, but not in images of the fiberless 

particles. Filamentous material likely corresponding to free viral DNA was seen 

in micrographs of fiberless particles. This material was also present in 

micrographs of the first-generation control virus, albeit at much lower levels. 

Three-dimensional image reconstructions of fiberless and wild-type 

particles at ~20 A resolution showed similar sizes and overall features, with the 

exception that fiberless particles lacked density corresponding to the fiber 

protein. The densities corresponding to other capsid proteins, including penton 

base and proteins Ilia, VI, and IX, were comparable in the two structures. This 

confirms that absence of fiber does not prevent assembly of these components 

into virions. The fiber was truncated in the wild-type structure as only the lower 

portion of its flexible shaft follows icosahedral symmetry* The RGO protrusions 

on the fiberless penton base were angled slightly inward relative to those of the 

wild-type structure. Another difference between the two penton base proteins 

was that there is a -30 A diameter depression in the fiberless penton base 

around the five-fold axis where the fiber would normally sit. The AdS 

reconstructions confirm that capsid assembly, including addition of penton base 

to the vertices, is able to proceed in the complete absence of fiber. 

6. Integrin-Dependent Infectivity of Fiberless AdS.Bgal.AF 
Particles 

While attachment via the viral fiber protein is a critical step in the 
infection of epithelial cells, an alternative pathway for infection of certain 
hematopoietic cells has been described. In this case, penton base mediates 
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binding to the cells (via 62 integrins) and internalization (through interaction with 
ovintegrins). Particles lacking fiber might therefore be expected to be 
competent for infection of these cells, even though on a per-particte basis they 
are several thousand-fold less infectious than normal Ad vectors on epithelial 
5 cells. 

To investigate this, THP-1 monocytic cells were infected with 
Ad5.Bgal.wt or with Ad5.Bgal.AF grown in the absence of fiber. Infection of 
THP-1 cells was assayed by infecting 2 x 10 5 cells at the indicated m.o.i. in 0.S 
ml of complete RPMI. Forty-eight hours post-infection, the cells were fixed with 

10 glutaraldehyde and stained with X-gal, and the percentage of stained cells was 
determined by light microscopy. The results of the infection assay showed that 
the fiberless particles were only a few-fold less infectious than first-generation 
Ad on THP-1 cells. Large differences were seen in plaquing efficiency on 
epithelial (21 1B) cells. Infection of THP-1 cells by either Ad5.Bgal.AF or 

15 Ad5.Bgal.wt was not blocked by an excess of soluble recombinant fiber protein, 
but could be inhibited by the addition of recombinant penton base). These 
results indicate that the fiberless Ad particles use a fiber-independent pathway to 
infect these cells. Furthermore, the lack of fiber protein did not prevent 
Ad5.BgalAF from internalizing into the cells and delivering its genome to the 

20 nucleus, demonstrating that fiberless particles are properly assembled and are 
capable of uncoating. 

The foregoing results with the recombinant viruses thus produced 
indicates that they can be used as gene delivery tools in cultured cells and in 
vivo as described more fully in the Examples. For example, for studies of the 

25 effectiveness and relative immunogenicity of multiply-deleted vectors, virus 

particles are produced by growth in the packaging lines described in €xample 1 
and are purified by CsCI gradient centrifugation. Following titering, virus 
particles are administered to mice via systemic or local injection or by aerosol 
delivery to lung. The LacZ reporter gene allows the number and type of cells 

30 which are successfully transduced to be evaluated. The duration of transgene 
expression is evaluated in order to determine the long-term effectiveness of 
treatment with multiply-deleted recombinant adenoviruses relative to the 
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standard technologies which have been used in clinical trials to date. The 

immune response to the improved vectors described here is determined by 

assessing parameters such as inflammation, production of cytotoxic T 

lymphocytes directed against the vector, and the nature and magnitude of the 

5 antibody response directed against viral proteins. 

Versions of the vectors which contain therapeutic genes such as CFPR 

for treatment of cystic fibrosis or tumor suppressor genes for cancer treatment 

are evaluated in the animal system for safety and efficiency of gene transfer and 

expression. Following this evaluation, they are used as experimental therapeutic 

10 agents in human clinical trials. 

B. Retargeting of Adenoviral Gene Delivery Vectors by Producing 
Viral Particles Containing Different or Altered Fiber Proteins 

As the specificity of adenovirus binding to target cells is largely 

determined by the fiber protein, viral particles that incorporate modified fiber 

15 proteins or fiber proteins from different adenoviral serotypes (pseudotyped 
vectors) have different specificities. Thus, the methods of expression of the 
native Ad5 fiber protein in adenovirus packaging cells as described above is also 
applicable to production of different fiber proteins. 

Chimeric fiber proteins can be produced according to kfiown methods 

20 (see, e.g., Stevenson eta/. (1995) J. Virol., 5S:28SO-2857). Determinants for 
fiber receptor binding activity are located in the head domain of the fiber and an 
isolated head domain is capable of trimerization and binding to cellular receptors. 
The head domains of adenovirus type 3 (Ad3) and Ad5 were exchanged in order 
to produce chimeric fiber proteins. Similar constructs for encoding chimeric fiber 

25 proteins for use in the methods herein are contemplated. Thus, instead of the 
using the intact Ad5 fiber-encoding construct prepared in above and in U.S. 
application Serial No. 09/482,682) as a complementing viral vector in adenoviral 
packaging cells, the constructs described herein are used to transfect -cells along 
with E4 and/or E1 -encoding constructs. 

30 Briefly, full-length Ad5 and Ad3 fiber genes were amplified from purified 

adenovirus genomic DNA as a template. The Ad5 and Ad3 nucleotide 
sequences are available with the respective taenBank Accession Numbers 
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M18369 and M1241 1. Oligonucleotide primers are designed to amplify the 
entire coding sequence of the full-length fiber genes, starting from the start 
codon, ATG, and ending with the termination codon TAA. For cloning purposes, 
the 5' and 3' primers contain the respective restriction sites BamHI and Notl for 
5 cloning into pcDNA plasmid as described in Example 1A. PCR is performed as 
described above. 

The resulting products are then used to construct chimeric fiber 
constructs by PCR gene overlap extension (Horton eta/. (1990) BioTechniques, 
8:525-535). The Ad5 fiber tail and shaft regions (5TS; the nucleotide region 

10 encoding amino acid residue positions 1 to 403) are connected to the Ad3 fiber 
head region (3H; the nucleotide region encoding amino acid residue positions 
136 to 319) to form the 5TS3H fiber chimera. Conversely, the Ad3 fiber tail and 
shaft regions (3TS; the nucleotide region encoding amino acid residues positions 
1 to 135) are connected to the Ad5 fiber head region (5H; the nucleotide region 

15 encoding the amino acid residue positions 404*to 581) to form the 3TSSH fiber 
chimera. The fusions are made at the conserved TLWT (SEQ ID NO: 19) 
sequence at the fiber shaft-head junction. 

The resultant chimeric fiber PCR products are then digested with BamHI 
and Notl for separate directional ligation into a similarly digested pcDNA 3.1. 

20 The TPL sequence is then subcloned into the BamHI as described in Example 1A 
for preparing an expression vector for subsequent transfection into 211 cells as 
described above or into the alternative packaging cell systems as previously 
described. The resultant chimeric fiber construct-containing adenoviral 
packaging cell lines are then used to complement adenoviral delivery vectors as 

25 previously described. Other fiber chimeric constructs are obtained with the 
various adenovirus serotypes using a similar approach. 

In an alternative embodiment, the use of modified proteins including with 
modified epitopes {see, e.g., Michael eta/. (1995) Gene Therapy, 2:t360^668 
and International PCT application Publication No. WO 95/2641 2, which describe 

30 the construction of a cell-type specific therapeutic viral vector having a new 

binding specificity incorporated into the virus concurrent with the destruction of 
the endogenous viral binding specificity). In particular, the authors described the 




PCT/EP01/04863 



-71- 

production of an adenoviral vector encoding a gastrin releasing peptide «3RP) at 
the 3' end of the coding sequence of the Ad5 fiber gene. The resulting fiber- 
GRP fusion protein was expressed and shown to assemble functional fiber 
trimers that were correctly transported to the. nucleus of HeLacelte following 
5 synthesis. 

Similar constructs are contemplated for use in the complementing 
adenoviral packaging cell systems for generating new adenoviral gene delivery 
vectors that are targetable, replication-deficient and less immunogenic. 
Heterologous ligands contemplated for use herein to redirect fiber specificity 

10 range from as few as 10 amino acids in size to large globular structures, some of 
which necessitate the addition of a spacer region so as to reduce or preclude 
steric hindrance of the heterologous ligand with the fiber or prevent trimerization 
of the fiber protein. The ligands are inserted at the end or within the linker 
region. Preferred ligands include those that target specific cell receptors or 

15 those that are used for coupling to other moieties such as biotin and avidin. 
A preferred spacer includes a short 12 amino acid peptide linker 
composed of a series of serines and alanine flanked by a proline residue at each 
end using routine procedures known to those of skill in the art. The skilled 
artisan will be with the preparation of linkers to accomplish sufficient protein 

20 presentation and to alter the binding specificity of the fiber protein without 
compromising the cellular events that follow viral internalization. Moreover, 
within the context of this disclosure, preparation of modified fibers having 
ligands positioned internally within the fiber protein and at the carboxy terminus 
as described below are contemplated for use with the methods described herein. 

25 

The preparation of a fiber having a heterologous binding ligand is prepared 
essentially as described in the above-cited paper. Briefly, for the ligand of 
choice, site-directed mutagenesis is used to insert the coding sequence for a 
linker into the 3 f end of the Ad5 fiber construct in pCLF as prepared in Example 
30 1. 

The 3' or antisense or mutagenic oligonucleotide encodes a preferred 
linker sequence of ProSerAlaSerAlaSerAlaSerAlaProGlySer 1S£Q tD NO: 20) 
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followed by a unique restriction site and two stop codons, respectively, to allow 
the insertion of a coding sequence for a selected heterologous ligand and to 
ensure proper translation termination. Flanking this linker sequence, the 
mutagenic oligonucloetide contains sequences that overlap with the vector 
5 sequence and allow its incorporation into the construct. Following mutagenesis 
of the pCLF sequence adding the linker and stop codon sequences, a nucleotide 
sequence encoding a preselected iigand is obtained, linkers corresponding to the 
unique restriction site in the modified construct are attached and then the 
sequence is cloned into linearized corresponding restriction site. The 

10 resultant fiber-ligand construct is then used to transfect 21 1 or the alternative 
cell packaging systems previously described to produce complementing viral 
vector packaging systems. 

In a further embodiment, intact fiber genes from different Ad serotypes 
are expressed by 21 1 cells or an alternative packaging system as previously 

15 described. A gene encoding the fiber protein of interest is first cloned to create 
a plasmid analogous to pCLF, and stable cell lines producing the fiber protein are 
generated as described above for AdS fiber. The adenovirus vector described 
which Jacks the fiber gene is then propagated in the cell line producing the fiber 
protein relevant for the purpose at hand. As the only fiber gene present is the 

20 one in the packaging cells, the adenoviruses produced contain only the fiber 
protein of interest and therefore have the binding specificity conferred by the 
complementing protein. Such viral particles are used in studies such as those 
described above to determine their properties in experimental animal systems. 

EXAMPLE 3 

25 Tripartite leader sequences (TPLs) that are useful in enhancing the 

expression of complementing adenoviral proteins, particularly fiber protein, for 
use in preparing an adenoviral gene delivery vector are provided. The complete 
Ad5 TPL was constructed by assembling PCR fragments, first, the third TPL 
exon (exon 3) (nt 9644-9731 of the AdS genome) was amplified from AdS 

30 genomic DNA using the synthetic oligonucleotide primers 

5XTCAACAATTGTGGATCCGTACTCC3MSEQ ID No. -24) and 
5'GTGCTCAGCAGATCTTGCGACTGTG3' (SEQ ID No. 25). The resulting 
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product was cloned to the BamHI and Bglll sites of pAE1Sp1a (Microbix 
Biosystems) using sites in the primers (shown in bold) to create plasmid pDV52. 
A fragment corresponding to the first TPL exon (exon 1), the natural first intron 
(intron 1), and the second TPL exon (exon 2) (Ad5 nt 6049-71.82) was then 
5 amplified using primers 5'GGCGCGTTCGGATCCACTCTCTTCC3' (S€Q ID No. 
26) and 5XTACATGCTA<3GCAGATCTCGTTCGGAG3' (SEQ ID No. 27),and 
cloned into the BamHI site of pDV52 (again using sites in the primers) to create 
pDV55. 

This plasmid contains a 1.2 kb BamHI/Bglll fragment containing the first 
10 TPL exon, the natural first intron, and the fused second and third TPL exons. 
The nucleotide sequence of the complete TPL containing the noted 5' and 3' 
restriction sites is shown in SEQ ID No 28 with the following nucleotide regions 
identified: 1-6 nt BamHI site; 7-47 nt first leader segment (exon 1); 48-1068 nt 
natural first intron (intron 1); 1069-1 140 nt second leader segment (exon 2); 
5 1141-1146 nt fused BamHI and Bglll sites; 1 147-1234 nt third leader segment 
(exon 3); and 1235-1240 nt Bglll site. 

EXAMPLE 4 

Deposit of Materials 

The following cell lines and plasmids were deposited on September 25, 
1996, with the American Type Culture Collection, 10801 University Blvd, 
Manassas, Virginia, USA (ATCC) under the provisions of the Budapest Treaty on 
the International Recognition of the Deposit of Microorganisms for the Purpose 
of Patent Procedure and the Regulations thereunder (Budapest Treaty): Plasmid 
pE4/Hygro (accession number 97739), Plasmid pCLF (accession number 97737), 
211 Cell Line (accession number CRL-12193) and 211 A Cell Line (accession 
number CRL-12194) 

The following virus, Ad5.£gal.AF, was deposited on January 15, 1999, 
with the ATCC as listed above and provided with accession number VR2636. 

Additionally, plasmids pDV60, pDV67, pDV69, pDV80 and pDV90 were 
deposited at the ATCC on January 5, 2000 and provided with accession 
numbers PTA-1 144, PTA-1 145, PTA-1 146, PTA-1 147 and PTA-1 148 
respectively. 
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EXAMPLE 5 

Preparation and Use of Adenoviral Packaging Ceil Lines 
Containing Plasmids Containing Alternative TPLs 

Plasmids containing tripartite leaders (TPLs) have been constructed. The 

5 resulting plasmids that contain different selectable markers, such as neomycin 

and zeocin, were then used to prepare fiber-complementing stable cell lines for 

use as for preparing adenoviral vectors. 

A. pDV60 

Plasmid pDV60 was constructed by inserting this TPL -cassette of SEQ ID 
10 No. 28 into the BamHI site upstream of the Ad5 fiber gene in pcDNA3/Fiber, a 
neomycin selectable plasmid (see, e.g., U.S. application Serial No. 09/482,682 
(also filed as International PCT application No. PCT/US00/00265 on January 14, 
2000); see also Von Seggern et al. 

(1998) J. Gen Virol., 79: 1461-1468). The nucleotide sequence of pDV60 is 
15 listed in SEQ ID NO: 29. Plasmid pDV60 has been deposited in the ATCC under 
accession number PTA-1 144. 

B. pDV61 

To construct pDV61, an Asp718/Notl fragment containing the CMV 
promoter, partial Ad5 TPL, wildtype Ad5 fiber gene, and bovine growth hormone 

20 terminator was transferred from pCLF (ATCC accession number 97737; and 
described in copending U.S. application Serial No. 09/482,682 (also filed as 
International PCT application No. PCT/USOO/00265 on January 14, 2000);), to a 
zeocin selectable cloning vector referred to as pCDNA3.1/Zeo ( + ) (commerically 
available from Invitrogen and for which the sequence is known). 

25 C. pDV67 

In an analogous process, pDV67 containing complete TPL was 
constructed by transferring an Asp 718/Xbal fragment from pDV60 into 
pcDNA3.1/Zeo( + ) backbone. The nucleotide sequence of pDV67 is set forth in 
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SEQ ID No. 30. Plasmid pDV67 is available from the ATCC under accession 
number PTA-1 145. 

D. pDV69 

To prepare pDV69 containing a modified fiber protein, the chimeric 
5 Ad3/Ad5 fiber gene was amplified from pGEM5TS3H (Stevenson -ef a/. <1995) J. 
Virol., 69: 2850-2857) using the primers 5'ATGGGAT 
CAAGATGAAGCGCGCAAGACCG3' {SEQ ID No. 31) and 

5'CACTATAGCGGCCGCATTCTCAGTCATCTT3' (SEQ ID No. 32), and cloned to 
the BamHI and Notl sites of pcDNA3.1/Zeo( + ) via new BamHI and Notl sites 
engineered into the primers to create pDV68. Finally, the complete TPL 
fragment described-above was then added to the unique BamHI site of pDV68 
to create pDV69. The nucleotide sequence of pDV69 is listed in SEQ ID No. 33 
and has been deposited in the ATCC under accession number PTA-1 146. 
E. Preparation of Stable Adenovirus Packaging Cell Lines 
E1-2a S8 cells are derivatives of the A549 lung carcinoma line (ATCC # 
CCL 185) with chromosomal insertions of the plasmids pGR£5-2.€1 (also 
referred to as GRE5-E1-SV40-Hygro construct and listed in SEQ ID No. 34) and 
pMNeoE2a-3.1 (also referred to as MMTV-E2a-SV40-Neo construct and listed in 
SEQ ID No. 35), which provide complementation of the adenoviral E1 and E2a 
functions, respectively. This line and its derivatives were grown in Richter's 
modified medium (BioWhitaker) + 10% FCS. E1-2a S8 cells were 
electroporated as previously described (Von Seggern eta/. (1998) J. Gen Virol., 
79: 1461-1468) with pDV61, pDV67, or with pDV69, and stable lines were 
selected with zeocin (600 /yg/ml). 

The cell line generated with pDV61 is designated 601. The cell line 
generated with pDV67 is designated 633 while that generated with pDV69 is 
designated 644. Candidate clones were evaluated by immunofluorescent 
staining with a polyclonal antibody raised against the Ad2 fiber. Lines 
expressing the highest level of fiber protein Were further characterized. 

For the S8 cell complementing cell lines, to induce E1 expression, 0.3 yt/M 
of dexamethasone was added to cell cultures 16-24 hours prior to challenge 
with virus for optimal growth kinetics. For preparing viral plaques, *5 X 10 s 



WOHP729 



PCT/EP01/04863 



-76- 

cells/well in 6 well plates are prepared and pre-induced with the same 
concentration of dexamethasone the day prior to infection with 0.15 pM included 
at a final concentration in the agar overlay after infection. 
F. Development of Cell Lines for Complementation of E17E2a' Vectors 
5 The Adenovirus 5 genome was digested with Seal enzyme, -separated on 

an agarose gel, and the 6,095 bp fragment containing the left end of the virus 
genome was isolated. The complete Adenovirus 5 genome is registered as 
Genbank accession #M73260, incorporated herein by reference/ and the virus is 
available from the American Type Culture Collection, Manassas, Virginia, U.S.A., 

10 under accession number VR-5. The Seal 6,095 bp fragment was digested 

further with Clal at bp 917 and Bglll at bp 3,328. The resulting 2,41 1 bp Clal to 
Bglll fragment was purified from an agarose gel and ligated into the superlinker 
shuttle plasmid pSE280 (Invitrogen, San Diego, CA), which was digested with 
Clal and Bglll, to form pSE280-E. 

15 Polymerase chain reaction (PCR) was performed to synthesize DNA 

encoding an Xhol and Sail restriction site contiguous with Adenovirus 5 DNA bp 
552 through 924. The primers which were employed were as follows: 
5' end, Ad5 bp 552-585: 

5'-GTCACTCGAGGACTCGGTC-X3ACTGAAAATGAGACATATTATCTGCCACGGA 
20 CC-3 f (SEQ ID No 36) 
3' end, Ad5 bp 922-891: 

5'-CGAGATCGATCACCTCCGGTACAAGGTTTGGCATAG-3' {SEQ ID No. 37) 

This amplified DNA fragment (sometimes hereinafter referred to as 
Fragment A) then was digested with Xhol and Clal, which cleaves at the native 
25 Clal site (bp 917), and ligated to the Xhol and Clal sites of pS£280-t, thus 

reconstituting the 5' end of the E1 region beginning 8 bp upstream of the ATG 
codon. 

PCR then was performed to amplify Adenovirus 5 DNA from bp 3,323 
through 4,090 contiguous with an EcoRI restriction site. The primers which 
30 were employed were as follows: 
5 r end, Ad5 bp 3323-3360: 
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5M^ATGAAGATCTGGAAGGTGCTGAGGTACGATGAGACC-3' (SEQ ID No. 38); 
and 

3' end, Ad5 bp 4090-4060: 

5'-GCGACTTAAGCAGTCAGCTG-AGACAGCAAGACACTTGCTTGATCCAAAT€C 
5 -3' (SEQ ID No. 39). 

This amplified DNA fragment (sometimes hereinafter referred to as 
Fragment B) was digested with Bglll, thereby cutting at the Adenovirus 5 Bglll 
site (bp 3,382) and EcoRI, and ligated to the Bglll and EcoRI sites of pS£280-A£ 
to reconstruct the complete E1a and E1b region from Adenovirus 5 bp 552 
10 through 4,090. The resulting plasmid is designated pSE280-E1. 

A construct containing the intact E1a/b region under the control of the 
synthetic promoter GRE5 was prepared as follows. The intact €1a/b region was 
excised from pSE280-E1, which was modified previously to contain a BamHI site 
3' to the E1 gene, by digesting with Xhol and BamHI. The Xhol to BamHI 
15 fragment containing the E1a/b fragment was cloned into the unique Xhol and 
BamHI sites of pGRE5-2/EBV (U.S. Biochemicals, Cleveland, Ohio) to form 
pGRE5-E1 ), 

Bacterial transformants containing the final construct were identified. 
Plasmid DNA was prepared and purified by banding in CsTFA prior to use for 

20 transfection of cells. 

Construction of plasmid including Adenovirus 5 E2A sequence. 

The Adenovirus 5 genome was digested with BamHI and Spel, which cut 
at bp 21,562 and 27,080, respectively. Fragments were separated on an 
agarose gel and the 5,518 bp BamHI to Spel fragment was isolated. The 5,518 

25 bp BamHI to Spel fragment was digested further with Smal, which cuts at bp 
23,912. The resulting 2,350 bp BamHI to Smal fragment was purified from an 
agarose gel, and ligated into the superlinker shuttle plasmid pSE280, and 
digested with BamHI and Smal to form pSE280-E2 BamHi-Smal. 

PGR then was performed to amplify Adenovirus 5 DNA from the Smal site 

30 at bp 23,912 through 24,730 contiguous with Nhel and fcofll restriction ^ites. 
The primers which were employed were as follows: 
5' end, Ad5 bp 24,732-24,708: 
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5'-CACGAATTCGTCAGCGCTTCTCGTCGCGTCCAAGACCC-3' (SEQ ID No. 40); 
3' end, Ad5 bp 23,912-23,934: 

5'-CACCCCGGGGAGGCGGCGGCGACGGGGACGGG-3' (SEQ ID No. 41) 

This amplified DNA fragment was digested with Smal and EcoRI, and 
5 ligated to the Smal and EcoRI sites of pSE280-E2 Bam-Sma to reconstruct the 
complete E2a region from Ad5 bp 24,730 through 21 ,562. The resulting 
construct is pSE280-E2a. 

In order to convert the BamHI site at the 3' end of E2a to a Sail site, the 
E2a region was excised from pSE280-E2a by cutting with BamHI and Nhel, and 

10 recloned into the unique BamHI and Nhel sites of pSE280. Subsequently, the 
E2a region was excised from this construction with Nhel and Sail in order to 
clone into the Nhel and Sail sites of the pMAMneo (Clonetech, Palo Alto, CA) 
multiple cloning site in a 5' to 3' orientation, respectively. The resulting 
construct is pMAMneo E2a. 

1 5 Bacterial transf ormants containing the final pMAMneo-E2a were 

identified. Plasmid DNA was prepared and purified by banding in CsTFA. 
Circular plasmid DNA was linearized at the Xmnl site within the ampicillin 
resistance gene of pMAMneo-E2a, and further purified by the phenol/chloroform 
extraction and ethanol precipitation prior to use for transfection of cells. 

20 Transfection and selection of cells. 

In general, this process involved the sequential introduction, by -calcium 
phosphate precipitation, or other means of DNA delivery, of two plasmid 
constructions each with a different viral gene, into a single tissue culture cell. 
The cells were transf ected with a first construct and selected for expression of 

25 the associated drug resistance gene to establish stable integrants. Individual cell 
clones were established and assayed for function of the introduced viral gene. 
Appropriate candidate clones then were transfected with a second construct 
including a second viral gene and a second selectable marker. Transfected cells 
then were selected to establish stable integrants of the second construct, and 

30 cell clones were established. Cell clones were assayed for functional expression 
of both viral genes. 
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A549 (ATCC Accession No, CCL-185) were used for transfection. 
Appropriate selection conditions were established for G418 and hygromycin Bby 
standard kill curve determination. 

Transfection of A549 cells with plasmids including E1 and E2a regions. 

pMAMNeo-E2a was linearized with Xmnl with the Amp* gene, introduced 
into cells by transfection, and cells were selected for stable integration of this 
plasmid by G418 selection until drug resistant colonies arose. The clones were 
isolated and screened for E2a expression by staining for E2a protein with a 
polyclonal antiserum, and visualizing by immunofluorescence. E2a function was 
screened by complementation of the temperature-sensitive mutant Ad5ts125 
virus which contains a temperature-sensitive mutation in the E2a gene. (Van Der 
Vliet, et al., J. Virology, Vol. 15, pgs. 348-354 (1975)). Positive clones 
expressing the E2a gene were identified and used for transfection with the 7 kb 
EcoRV to Xmnl fragment from pGR£5-E1, which contains the GRE5 promoted 
E1a/b region plus the hygromycin R gene. Cells were selected for hygromycin 
resistance and assayed for E1a/b expression by staining with a monoclonal 
antibody for the E1 protein (Oncogene Sciences, Uniondale, N.Y.). E1 function 
was assayed by ability to complement an E1 -deleted vector. At this point, 
expression and function of E2a was verified as described above, thus 
establishing the expression of E1a/b and E2a in the positive cell clones. 

A transfected A549 ( A549 (ATCC Accession No. CCL-1 85);) cell 
lines showed good E1a/b and €2a expression and was selected for further 
characterization. It was designated the S8 cell line. 

G. Preparation of Adenoviral Vectors Containing Ad5./?gal.AF Genome in SB 
improved Fiber-Complementing Cell Lines 

To prepare adenoviral vectors containing Ad5.£gal.AF (AdS.j9gal.AF has 
been was deposited the ATCC under accession number VR2636) in S8 cells 
containing alternative forms of TPL for enhancing the expression of fiber 
proteins, the protocol as described in Example 2 for preparing Ad5.£gal.AF in 
21 1B cells was followed with the exception of pretreatment with 0.3 jjM 
dexamethasone for 24 hours as described above. Thus, viral particles with the 
wildtype Ad5 fiber protein on their surface and containing the fiberless 
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Ad5.jffgal.AF genome were produced in 633 cells. Particles produced in *644 
cells also contained the fiberless Ad5./?gaI.AF genome, but had the chimeric 
5T3H fiber protein, with the Ad3 fiber knob, on their surface. 

Thus, these viral preparations, prepared as described herein are useful for 
5 targeting delivery of the Ad5.yffgal.AF, Ad5.GFP.AF, or other similarly 

constructed fiberless genome with either wild-type or modified fibers. Preferably 
for purposes herein the fibers are from an Ad serotype D virus, more preferably 
from Ad37. 

EXAMPLE 6 

10 Pseudotyping and Infectivity of Recombinant Adenoviral Vectors 

Produced with Improved fiber-Complementing Cell Lines 

A. Pseudotyping of Ad5.jffgal.AF 

To verify that adenoviral vectors were produced had altered tropisms, 
viral particles were purified from either 633 or 644 cells and were then Western 
1 5 blotted and probed with a polyclonal rabbit antibody against the Ad2 fiber 
(which detects the Ad5 and chimeric 5T3H fiber proteins). 

B. Infectivity of Ceils with 633 or 644 Generated Virus Particles The 
cell lines, 633 or 644, prepared as described above, were infected with the 
indicated number of particles/cell of Ad5.£gal.AF and virus particles produced. 

20 Virus was then used to infect selected cell lines, including 21 IB, MRC-S human 
fibroblasts, A-10 rat aortic endothelial cells, and THP-1 human monocytic cells. 
Unbound virus was removed by washing the cells and the cells were further 
incubated at 37 °C for 48 hours. Cells were then fixed with glutaraldehyde and 
stained with X-gal. The percentage of stained cells was then determined by light 

25 microscopy where all experiments were done in triplicate. 

The results indicated that adenoviral vectors could be retargeted by 
pseudotyping using packaging cell lines expressing different fiber proteins. 
Particles containing either fiber were equally infectious on 21 IB cells, while 
MRC-5 fibroblasts and THP-1 cells were more readily infected by virus containing 

30 the chimeric fiber. The A-10 rat endothelial cells were more readily infected by 
particles containing the wildtype Ad5 fiber protein. 
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EXAMPLE 7 

Transient Transcomplementation 

The ability of adenovirus type 5 (Ad5) to deliver therapeutic genes to cells 
is mediated by the interaction of the adenoviral fiber protein with the 
5 coxsackievirus-adenoviral receptor (CAR). Because a wide-range of cells express 
CAR, it was thought that it would be difficult to use adenoviruses to deliver 
genes to specific cell types. A system for testing modified fiber genes to 
identify tropisms of interest is described in copending U.S. application Serial 
No. 09/482,682 (also filed as International PCT application No. 9 
10 PCT/US00/00265 on January 14, 2000). An in vitro system has been 

developed that involves infection of tissue culture cells with a fiber-deleted Ad 
and transient co-transfection with a plasmid directing fiber expression. This 
system allows one to produce and evaluate modified fibers expressed on a viral 
particle. This system can be used to produce therapeutic quantities of 
15 adenoviral vectors with modified fiber proteins, with such fibers having a new 
tropism added by insertion of a desired ligand into the fiber gene. These fibers 
may also have the natural tropism {i.e. binding to CAR) ablated. 

Plasmids used were pDV60 and pDV55 were prepared as described 
herein and in U.S. application Serial No. 09/482,682 (also filed as International 
20 PCT application No. PCT/US00/00265 on January 14, 2000). pDVCO is an a 
pcDNA3.1 -based expression plasmid that contains theCMV promoter, Ad5 
tripartite leader, an intron, and the Ad5 fiber gene sequence. pDV55 contains 
no fiber gene and serves as the negative control. Ad5.0gal.AF and 21 1B are 
described above. 293T cells are identical to 293 cells except they express an 
25 integrated SV40 large T antigen gene. HDF cells are human diploid fibroblasts. 
293T cells express CAR and a v integrins; HDF cells express a v integrins but no 
CAR. Transfections with fiber expression plasmids were performed with 
Lipofectamine (GIBCO-BRL) using 20mg DNA and 50ml Lipofectamine per 1Scm 
dish. Cells were maintained in DMEM supplemented with 10% fetal bovine 
30 serum. 

The fiber deletion mutation of Ad5.£gal.AF is complemented in trans by 
passaging virions through 21 1 B, a cell line that stably expresses functional Ad5 
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fiber. The present system was designed to complement Ad5.£gaLAF by 
modified fibers expressed from transfected episomal plasmkJs in 293T -cells. The 
result is a simplified and rapid method to incorporate modified fibers on a viral 
particle containing the Ad5,/?gal.AF genome that does not require propagation of 



The feasibility of transcomplementation of Ad5./?gal.AF with episomal 
fiber-expressing plasmids was demonstrated in the following experiment. 293T 
cells were transfected with one of two plasmids: pDV55, which expresses no 
fiber or pDV60, which expresses wildtype Ad5 fiber. Fiber expression persists 
for at least six days. Twenty-four hours after transfection, these cells were 
infected at 2000 particles/cell with Ad5.£gal.AF passaged through 21 1B cells. 
Seventy-two hours later, a crude viral lysate (CVL) was generated by exposing 
the cells to five freeze-thaw cycles. Viral particles were purified by cesium 
chloride gradient centrifugation. The resulting virions incorporated the fiber 
expressed from the episomal plasmid, as confirmed by Western blots performed 
with an antibody specific to the Ad5 fiber. 

Episomal plasmid transcomplementation system is suitable for quickly 
expressing and evaluating the properties of modified fibers in the context of a 
viral particle. Episomal plasmid transcomplementation will also be of great utility 
for quickly evaluating a bank of modified fibers for other binding properties, 
including new tropisms and the ablation of the native tropism. In addition to the 
rapid generation and testing of large numbers of modified fibers, there are other 
advantages to the Ad5./?gal.AF transcomplementation system in terms of 
production and safety. Episomal plasmid transcomplementation has the inherent 
advantage over transcomplementation in that it is not necessary to make a 
stable cell line for every modified fiber for complementation with Ad5.£gal.AF. 
Because the Ad5.£gal.AF is deleted in E1, E3 and fiber, there is an additional 
gene deletion, which should render it very suitable for gene therapy. In addition, 
the presence of the fiber gene deletion decreases the opportunity to generate 
replication-competent virus via recombination in the packaging cells. A single 
Ad vector preparation can be retargeted to any number of different cell types 
simply by transfecting the cells with the appropriate fiber^expcession construct. 



5 



the virus. 
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EXAMPLE 8 

Preparation of Adenoviral Gene Delivery Vectors Containing the Ad37 fiber 
protein 

This example describes construction of packaging cell lines expressing 
5 the Ad37 fiber protein, and their use in generating particles of a fiber-deleted Ad 
vector (such as Ad5.£gal.AF) containing this fiber protein. The fiber protein is 
attached to the viral capsid by binding to the penton base protein through its N- 
terminus, and the Ad37 fiber was modified in order to make its N-terminal 
sequence more closely match that of the Ad5 protein to ensure that it would 
10 efficiently bind the Ad5 penton base in these vectors. 

A. Materials and methods 

Cell lines and wild-type adenovirus. Human A549 lung carcinoma 
epithelial cells and human Chang C conjunctival cells (American Type Culture 
Collection) were maintained in complete Dulbecco's Modified Eagle Medium 

15 (DMEM) with 10% fetal bovine serum. Wild-type Ad19p and Ad37 (ATCC) were 
propagated in A549 cells and purified by banding on CsCI 2 density gradients as 
previously described (Huang eta/. (1999) J. Virol. 75:2798-2802). Viral protein 
concentration was determined by the Bio-Rad Protein Assay, and was used to 
calculate the number of viral particles based on the known molecular weight of 

20 Ad2 virions (1 //g = 4 x 10 9 particles). 

B. Construction of the Ad37 fiber expressing cell lines and the recombinant 
Ad37 knob protein. 

1 . Construction of an Expression Plasmid for the Ad37 Fiber Protein 
(pDV80) 

25 The plasmid designated pDV80 (see, SEQ ID No. 42) prepared for 

expression of the Ad37 fiber protein in mammalian cells, uses the same 
regulatory elements as the elements in pDV60, pDV67, and pDV69 to express 
the Ad37 fiber in packaging lines. It was constructed in two steps. 

First, the Ad37 fiber open reading frame was amplified from Ad37 

30 genomic DNA using synthetic oligonucleotide primers, L37: 5' TGT^CCT GGA 
TCC AAG A TG AAG CGC GCC CGC C CC AGC GAA GAT GAC TTC 3'{SEQ ID 
NO. 43) and 37FR: 5' AAA CAC GGC GGC CGC TCT TTC ATTCTT G 3"SEQ 
ID NO. 44). L37 contains nucleotides (underlined) that differ from the Ad37 
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genomic sequence in order to add a unique fla/nHI site (bold) before the start 
codon (italicized) and to create point mutations that make the N-terminal 
sequence of the fiber more closely match the N-terminal sequence of the Ad 5 
fiber protein as follows: 
5 Ad37 MSKRLRVEDDFNPVYPY (SEQ ID No. 45) 

KRARPS (SEQ ID No. 46) 

Ad5 M KRARPS EDTFNPVYPY (SEQ ID No. 47) . 

37FR also incorporates a unique M>f1 site (bold). The PCR product was inserted 

10 into the BamHI and Afofl sites of pCDNA3.1zeo{ + ) flnvitrogen) to create 
pDV78. The correct sequence of the Ad37 fiber protein, including inserted 
changes, was confirmed by sequencing. 

Two point mutations in the fiber gene in the 705 line, S356 to P356 and 
I362 to T362, were discovered by the sequencing. The mutations are not in the 

15 receptor binding domain in Ad37 fiber gene in the 705 cell line. They are buried 
in the knob trimer interface. To confirm that the these mutations do not affect 
receptor binding, the Ad37 fiber protein with the correct sequence was recloned, 
and 293T cells transfected with the virus and subsequently infected with 
Ad5.GFPAF to produce Ad37 pseudotyped virus. The results were the same as 

20 the results of the experiments with Ad37 pseudotyped virus produced from line 
705 (see, Wu eta/. (2001) Virology 273:78-89). 

Second, a 1.2 kb Bam WWBgl II fragment containing an adenovirus type 5 
tripartite leader was excised from pDV55 (see EXAMPLE 3) and inserted into the 
Bam H1 site of pDV78 to create pDV80 (SEQ ID No 42). Plasmid pDV80 has 

25 been deposited in the ATCC under accession number PTA-1 147. 

2. Construction of the recombinant Ad37 knob protein 
Recombinant Ad37 knob protein containing an N-terminal T7»Tag was 
produced in £ co// using the PET expression system {Novagen). Ad37 fiber f)NA 
(GenBank accession number U69132) was PCR amplified from wild-type Ad37 

30 genomic DNA using the following primers (SEQ ID Nos. 48 and 49): 
5' GGATCC 4 TGGGATACTTGGTAGCA 3' {BamH\ site underlined and 
5' GCAA CTCGAGT CATTCTTGGGCAATATAGG 3'<Xhol site underlined). 
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The PCR reactions were performed at 94 °C (denaturation), 55 °C 
(annealing), 72 °C (extension, 30 cycles) using Taq DNA polymerase (Qiagen). 
The amplified DNA fragments, which contained residues 1 72 to 365 of the Ad37 
fiber protein with the addition of an N-terminal start, codon (italicized), were 
purified and subcloned into the pCR-TOPO vector using the TA-Cloning Kit 
(Invitrogen). No replication errors were found by DNA sequencing. Plasmids 
from cultured transformed colonies were purified and digested with BamH\ and 
Xho\. The fragment was inserted into the BamHl and Xhol sites of the bacterial 
expression vector, pET21a (Novagen), and transformed into (D£3)pLYS S 
expression cells (Invitrogen). Colonies were selected for knob expression by 
induction with 1 mM IPTG for four hours at 37 °C and knob expression was 
determined by SDS-PAGE. The colony displaying highest knob expression was 
used for large-scale knob expression and induced with 0.5 mM IPTG at 30 °C for 
four hours. 

The recombinant T7»Tagged Ad37 knob protein was purified from 
sonicated bacteria using the T7«Tag Affinity Purification Kit as recommended by 
the manufacturer (Novagen). Recovered protein was analyzed for purity by SDS- 
PAGE followed by Coomassie staining or Western blotting with an HRP- 
conjugated ar-T7»Tag monoclonal antibody as described by the manufacturer 
(Novagen) or an a-Ad37 fiber rabbit antibody. 

3. Preparation of Cell Lines that Express the Ad37 fiber protein 

Plasmid pDV80 DNA was purified using the Qiagen method and 
electroporated into the adenovirus-complementing cell line €1-2aS8 (see 
Examples herein; see also, Gorziglia et al. (1996) J. Virology 70:4173-4178; 
and Von Seggern et at. (1998) J. Gen. Virol. 73:1461-1468). ^Stable olones were 
selected with 600//g/ml zeocin (Invitrogen). 

Clones were expanded and were screened for fiber expression by indirect 
immunofluorescence (Von Seggern et al. (1998) J.Gen. Virol. 73:1461-1468) 
using a rabbit polyclonal antibody directed against the Ad37 fiber (a-Ad37 fiber 
rabbit antibody) raised by immunizing rabbits with recombinant Ad37 fiber 
protein. Two clones (lines 705 and 731) that expressed the protein at a 
uniformly high level were selected. 
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EXAMPLE 9 

Production of Pseudotyped Ad Vector Particles 

To generate vector particles equipped ('pseudotyped') with the Ad37 
fiber protein, the Ad37 fiber-expressing 705 cells were infected (approximately 
5 1000 particles/cell) with Ad5.0gal.AF or with Ad5.GFP.AF. 
. Materials and methods 

Ad5./?gal.AF 

The construction of Ad5./?gaI.AF is described in Example 2 (it has been 
deposited on January 15, 1999, with the ATCC as listed above under accession 
10 number VR2636; see also, Von Seggern etaf. (1999) J.Virol. 73:1601-1608; 
copending U.S. application Serial No. 09/482,682 filed January 14, 2000, and 
also International PCT application No. PCT/USOO/00265, filed January 14, 
2000). 

Ad5.GFP.AF 

15 Ad5.GFP.AF was constructed by recombination in bacteria using a 

modification of the AdEasy System (see, U.S. Patent No. 5,922,576; see, also 
He etal. (1998) Proc. Nat/. Acad. Sci. U.S.A. 35:2509-2514; the system is 
publicly available from the authors and other sources). 

First, a fiber-deleted genomic plasmid was constructed by removing the 

20 fiber gene from pAdEasy-1 (see, U.S. Patent No. 5,922,576; and He etal. 
(1998) Proc. Natl. Acad. Sci. U.S.A. 35:2509-2514; the AdEasy system and 
vectors are publicly available from He et al. at Johns Hopkins University). 
Plasmid pAdEasy-1 contains the entire Ad5 genome, except for nucleotides 1- 
3,533, which encompass the E1 genes, and nucleotides 28„ 130-30,820, which 

25 encompass the E3 gene. 

Plasmid pDV43 (see Example 2; see, also Von Seggern etal. (1999) 
J.Virol. 73:1601-1608) was digested with Pad, the ends blunted by treatment 
with the large fragment of £ co/i DNA polymerase and dNTPs, and the product 
re-Jigated to produce plasmd pDV76. The resulting plasmid pDV76 is identical to 

30 pDV43 except for loss of the Pac\ site and contains the right -end of the Ad5 
genome with E3 and fiber deletions. A 4.23 kb fragment from PDV76 was 
amplified using the oligonucleotide primers (SEQ ID Nos.50 and 51: 
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5' CGC GCT GAC TCT TAA GGA CTA GTT TC 3', including the unique Spe1 
site in the Ad5 genome (bold); and 5' <5CG CTT AAT TAA CAT CAT CAA TAA 
TAT ACC TTA TTT T 3', including a new Pac\ site (bold) adjacent to the right 
Ad5 ITR. Hence the resulting PCR amplified fragment contains nucleotides 
5 27,082 to 35,935 of the Ad5 genome with deletions of nucleotides 28,1 33 *o 
32,743 (the E3 and fiber genes), and was used to replace the corresponding 
SpeMPad fragment of pAdEasy 1 (see, U.S. Patent No. 5,922,576) to -create 
pDV77. 

Second, £ coli strain BJ5183 was electroporated with a mixture of 
10 pDV77 and Pmel -linearized pAdTrack as described (U.S. Patent No. 5,922,576; 
He eta/. (1998) Proc. Natl. Acad. Sci. U.S.A. 35:2509-2514), and DNA was 
isolated from kanamycin-resistant colonies. The resulting plasmid, pDV83, 
contains a complete Ad5 genome with E1-, E3-, and fiber-deletions with a CMV- 
driven GFP reporter gene inserted at the site of the €1 deletion. The full length 
Ad chromosome was isolated by Pad digestion, and transfected into the -El- 
and fiber-complementing 633 cells (Von Seggern etaL (2000) J.Virol. 74:354- 
362). The 633 cells were produced by electroporating pDV67 (SEQ ID No. 30, 
deposited under ATCC accession number PTA-1 145) into the €1-2a S8 cells, 
described above. The recovered virus Ad5:T3FP.AF was then plaque purified t>y 
plating on 633 cells and virus stocks were prepared by freeze-thawing cell 
pellets. 

Ad5-pseudotype particle production 
Particles with Ad5 fiber 

Ad5-pseudotyped particles were generated by virus growth in 633 ceRs, 
which express the wild type Ad5 fiber protein. Viral particles were isolated and 
purified over CsCI gradients (Von Seggern era/. (1999) J. Virol. 73: T601-1 608; 
purified by centrifugation on preformed 1 5-40% CsCI gradients <1 1 1 ,000 x g for 
three hours at 4°Q). For analysis of viral proteins, ten //g of the purified 
particles were electrophoresed on 8-16% gradient gels and the protein 
transferred to nylon membranes. The resulting blot was probed with rabbit 
polyclonal antibodies raised against recombinant Ad37 fiber or Ad5 fiber or 
penton base proteins expressed in baculovirus-infected cells. 
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Particles with Ad37 fiber 

Cells from the Ad37 fiber producing cell line 705 were infected at 
approximately 1000 particles/cell with Ad5.£gal.AF or with Ad5.<3FP.AF. Viral 
particles were isolated and purified over CsCl gradients*. The bands were 
5 harvested, dialyzed into storage buffer (10 mM Tris-pH 8.1 , 0.9% NaCI, and 
10% glycerol), aliquoted and stored at-70°C. 
Viral protein analyses 

For analysis of viral proteins, 10 jug of purified Ad5.£gal.AF particles with 
no fiber (grown in 293 cells), the Ad5 fiber (grown in 633 cells), or the Ad37 
10 fiber (grown in 705 cells) were electrophoresed by 8-1t>% polyacrylamide 

gradient SDS-PAGE and the proteins were transferred to nylon membranes. The 
blot was then probed with a-Ad37 fiber rabbit antibody. Ad5 fiber and penton 
base were detected by reprobing the blot with polyclonal antibodies raised 
against recombinant proteins expressed in baculovirus-infected cells (Wickman et 
15 al. (1993) Cell 2:309-3 19). 

Adenovirus infection and cell binding assays 

Adherent Chang C and A549 cells were infected with <3FP expressing 
Ad5 vectors containing the Ad5 fiber (Ad5.X5FP.AF/5F) or the Ad37 fiber 
(Ad5.GFP.AF/37F) at 10,000 particles per cell for 3 hours at 37 *C, 5%<X> 2 in 

20 DMEM, 10% FCS. Cells were washed twice with saline and then cultured 

overnight at 37° C, 5% C0 2 . The next day, the cells were detached with buffer 
containing 0.05% (w/v) trypsin and 0.5 mM CDTA (Boehringer Mannheim) for 5 
minutes at 37 °C. Suspended cells were washed once with PBS and then 
resuspended in phosphate-buffered saline (PBS), pH 7.4. GFP fluorescence was 

25 measured with a FACScan flow cytometer. A threshold established by the 
fluorescence of uninfected cells was used to distinguish cells expressing <3FP. 
To assess the role of CAR in Ad infection, 1t),000 attached cells were pre- 
incubated with 1 80 //g/ml RmcB, a function-blocking anti-CAR monoclonal 
antibody (Hsu eta/. (1988) J.Virol. 62:1647-1652), in complete DMEM for 1 

30 hour at 4*C. A small volume containing Ad5.GFP.AF/5F or Ad5.GFP.AF/37F 
was then added at 10,000 particles per cell. The cells were infected for 3 
hours, cultured overnight, harvested, and analyzed for GFP expression. Percent 
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ceils expressing GFP was determined by the percent of cells detected above a 
threshold set by the fluorescence of uninfected Chang C cells. 

To measure adenovirus binding to cells, wild type Ad37 was labeled with 
125 l using lodogen (Pierce) according to manufacturer instructions and separated 
5 from free 125 l by gel filtration as described (Huang et al. (1999) J. Virol. 

75:2798-2802). Binding of radiolabeled wild type Ad37 on Chang C cells was 
then quantitated as described (Huang eta/. (1999) J. Virol. 73:2798-2802). 
Non-specific binding was determined by incubating cells and labeled Ad37 
particles in the presence of 100-fold concentration of unlabeled Ad37. Specific 
10 binding was calculated by subtracting the non-specific binding from the total 
cpm bound. To examine if divalent cations are required for binding, 10 mM 
ethylenediaminetetraacetic acid (EDTA) or various concentrations of CaCI 2 , or 
MgCI 2 were added to cells before incubation with labeled virus. To examine if 
the receptor for Ad37 is a protein, cells were pretreated with 10 //g/ml trypsin 
15 (GIBCO), subtilisin (Sigma), proteinase K (Boehringer-Mannheim), and bromelain 
(Sigman) at 37 °C for 1 hour, then washed twice with complete DMEM before 
adding labeled virus. Cells were >95% viable after protease treatment. 

Ad37 binding to conjunctival cells is calcium-dependent. Specific 12S I- 
labeled Ad37 binding to Chang C cells was measured in the presence of 10 mM 
20 EDTA and in the presence of varying concentrations of calcium chloride or 
magnesium chloride. Specific binding was determined by subtracting the 
nonspecific counts in the presence of 100-fold excess unlabeled virus from the 
total counts. 

Pretreatment of conjunctival cells with proteases inhibits Ad37 binding. 

25 Change C cells were pretreated with various proteases for 1 hour before binding 
125 l-labeled Ad37 to the cells. Nonspecific binding was measured by adding 
100-fold unlabeled Ad37 to cells with 125 l-labeled Ad37 and subtracting from 
total counts for specific binding. Percent inhibition represents the difference in 
specific binding of untreated cells and pretreated cells as a percentage of the 

30 specific binding of untreated cells. 
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Virus overlay protein blot assay (VOPBA) 

For VOPBA of human conjunctival membrane proteins probed with Ad37 
in the presence of EDTA or calcium chloride, Chang C membrane fractions were 
separated by 8% SDS-PAGE and transferred to a PDVF membrane. The 
5 membrane was subsequently probed with or without whole Ad37 particles, a 
polyclonal antibody against Ad37 fiber, and finally a horseradish peroxidase 
conjugated anti-rabbit antibody, in the presence of 1EDTA or calcium chloride. 
Transferred Chang C membrane proteins were probed with recombinant Ad37 
knob protein, instead of Ad37 knob, in the presence of calcium chloride. 

10 Confluent monolayers of Chang C and A549 cells were detached by 

scraping, pelleted by centrifugation, and then resuspended in 250 mM sucrose, 
20 mM HEPES, pH 7.0, 1 mM EDTA, and 2/yg/ml aprotinin and teupeptin. Cells 
were transferred into a dounce homogenizer and disrupted with 30 strokes. 
Organelles and nuclei were pelleted atSOOg for 15 min. Plasma membrane 

15 fragments were then pelleted from the supernatant of cell lysates at*200,000g 
for 1 hour and then resuspended in 10 mM Tris*CI, pH 8.1, 10//g/ml aprotinin 
and leupeptin. 

Cell membranes of Chang C or A549 cells were incubated (1 :1) with a 
2% SDS, non-reducing buffer and separated on an 8% polyacrylamkJe gel 

20 without boiling. Membrane proteins were then electroblotted onto a PVOF 
membrane (Immobilon-P) and blocked in 5% (w/v) milk in PBS, pH 7.4,0.02% 
Tween-20 (PBS-T). After blocking, the membrane was incubated with 1 //g/ml 
wild-type Ad19p or Ad37 in 0.5% (w/v) milk in PBS-T, 1 mMCaCI 2 , for 1 hour 
at room temperature. The membrane was then washed once with phosphate- 

25 buffered saline, pH 7.4 (PBS), 1 mM CaCI 2 , and incubated with 11500 dilution of 
a-Ad37 fiber rabbit antibody in 0.5% (w/v) milk in PBS-T, 1mM CaCI 2 , for 30 
minutes at room temperature. The membrane was washed again with *PBS, 1 
mM CaCI 2 , and incubated with 1 :5000 dilution of horseradish peroxidase (HRP) 
conjugated a-rabbit antibody (Sigma) in 0.5% (w/v) milk in PBS-T, 1 mM CaCI 2 , 

30 for 30 minutes at room temperature. The membrane was washed four times in 
PBS, 1 mM CaCI 2 , once with PBS-T, 1 mM CaCI 2 , and once in 1 mM CaCI 2 . The 
blot was developed with enhanced chemiluminescence reagents (Pierce) for 15 
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minutes and placed onto a piece of Biomax film (Kodak) for 5 seconds to 1 

minute. For divalent metal cation experiments, membranes ware incubated in 

the presence of 2 mM EDTA instead of 1 mM CaCI 2 in all solutions. To assay 

fiber knob binding to cell membrane proteins, membrane filters were incubated 

5 with 1 //g/ml purified T7-tagged Ad37 knob protein in Tris-buffered saline, 0.1% 

Tween-20, 1 mM CaC1 2 , for 1 hour at room temperature. a-Ad37 fiber rabbit 

antibody and HRP-conjugated anti-rabbit antibody were applied and the 

membrane was developed with substrate solution as described above. 

Results: Comparison of adenovirus infection of human conjunctival and liing 
10 epithelial cells with virus particles retargeted with AdS or Ad37 fiber proteins 

Packaging cell lines producing the Ad37 fiber protein were generated. 

Since the N-terminal amino acid sequences of the Ad5 and Ad37 fiber proteins 

differ significantly, and to ensure that the Ad37 fiber would be efficiently 

incorporated into AdS vector particles, several residues in the wild-type Ad37 

15 fiber were mutated to more closely match the Ad5 sequence. Stable cell lines 

producing this fiber under control of the CMV promoter and the adenovirus type 

5 tripartite leader were then generated and screened for fiber expression by 

indirect immunofluorescence. One clone (line 70S), which expressed the Ad37 

fiber at a high level, was selected for further study. 

20 Cells from one cell line 633, which expresses the wild-type AdS fiber 

protein, and line 705 were infected with a fiber-deleted AdS vector carrying a 

#galactosidase reporter gene. The resulting vector particles contained the AdS 

fiber protein (Ad5.£gal.AF/5F) and the Ad37 fiber protein (Ad5.j9gal.AF/37F), 

respectively. Incorporation of the correct fiber protein into viral particles was 

25 verified by Western blotting. Adenoviral vectors containing the GFP reporter 

gene, Ad5.GFP.AF/5F and Ad5.GFP.AF/37F, were created in the same fashion. 

Infection of a variety of cell types using the retargeted adenovirus 

particles was examined. As assayed by GFP fluorescence, Ad5.<3FP.AF/5F 

exhibited good gene delivery to lung epithelial {A549) and conjunctival celis 

30 (Chang C). In contrast, Ad5.GFP.AF/37F efficiently delivered GFP to -Chang C 

cells, but exhibited very poor gene delivery to A549 cells.' Although CAfi is 

expressed on the surface of A549 cells, as indicated by AD5.<3FP.AF/5F 
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infection, Ad5.GFP.AF/37F was unable to infect these cells efficiently. This 
experiment shows that the Ad37 fiber protein can confer preferential infection of 
human conjunctival cells, but not CAR-expressing human lung epithelial cells. 

Hence CAR is not the primary receptor for Ad37. Recent studies reported 
5 that expression of CAR on the surface of Chinese hamster ovary (CHO) cells did 
not improve Ad37 binding (Arnberg et al. (2000) J. Virol. 74:42-48), implying 
that Ad37 does not use CAR as a primary receptor. In order to verify this on 
human conjunctival cells, A549 and Chang C cells were pretreated with RmcB 
(Hsu et al. (1988) J. Virol 52:1647-1652), a function-blocking monoclonal 

10 antibody against CAR. The RmcB antibody inhibited infection of A549<;ellsby 
Ad5.GFP.AF/5F, but it had little effect on infection of Chang C ceUsby 
Ad5.GFP.AF/37F. This indicates that CAR is not the primary receptor for Ad37 
on Chang C conjunctival cells. 

Ad37 binding to conjunctival cells requires divalent metal cations. It has 

15 been proposed (Roelvink et ah (1998) J.Virol. 72:7909-7915) that a combination 
of fiber binding to CAR and penton base binding to a v -integrins allows some 
adenovirus serotypes to attach to cells. Although a v -integrin binding to the RGD 
motif of the adenovirus penton base is of relatively low affinity {Wickman et al. 
(1993) Cell 2:309-31 9), it may nonetheless contribute to viral attachment to the 

20 cell surface. Ad37 shows a particularly strong affinity for binding to integrin cg? 5 
(Mathias et al. (1 998) J.Virol. 72:8669-8675), suggesting that integrin a^ s 
might be a primary receptor for Ad37. Binding of the RGD motif by a v -integrins 
requires the presence of divalent cations, such as calcium or magnesium (Stuiyer 
etal. (1996) J. Cell Physiol. 755:521-531). In contrast, no divalent cations were 

25 required for binding in the CAR-Ad12 knob complex (Bewley et al. (1999) 
Science 286: 1 579-1 583). 

To investigate the potential role of a v -integrins and divalent metal cations 
in Ad37 receptor binding, 125 MabeIed Ad37 binding to Chang C cells was 
examined in the absence or presence of EDTA. EDTA inhibited Ad37 binding to 

30 conjunctival cells but did not alter Ad5 binding. These findings suggest a 
requirement for divalent metals for Ad37 binding. 
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The presence of either calcium or magnesium ions helps aj? 5 organize in focal 
contacts (Stuiver et at. (1 996) J.Cell Physiol. 755:521-531), suggesting that 
calcium and magnesium aid in integrin aj9 5 function. To further test the 
potential role of integrin aJ3± in Ad37 cell attachment, 125 l-laheJed.Ad37 binding 
5 to Chang C cells was measured in the presence of varying concentrations of 
calcium or magnesium chloride. Magnesium ions had little effect on Ad37 
binding to Chang C cells. In contrast, calcium ions dramatically enhanced Ad37 
binding to Chang C cells. The optimal concentration of calcium chloride for 
Ad37 binding was 1 mM, while higher concentrations of calcium actually 
decreased virus binding to cells. The fact that calcium, but not magnesium, 
promoted Ad37 attachment is not consistent with integrin oJJq as the primary 
receptor for viral attachment to the cells since either metal will support ligand 
binding to integrin aj? 5 . Moreover, A549 cells express abundant cr v -integrins 
(Mathias eta/. (1998) J.Virol. 72:8669-8675) but were unable to support 
efficient binding of Ad37. 

Wild-type Ad37 particles bind to three conjunctival membrane proteins. 
Recent studies reported that protease treatment of CHO cells abolished Ad37 
binding (Arnberg et a/. (2000) J. Virol. 74:42-48), implying that Ad37 bound to 
a protein receptor on CHO cells. Scatchard analysis of Ad37 binding to<:hang C 
cells showed that each cell expresses approximately 24,000 fiber binding ^ites 
(Huang etal. (1999) J. Virol. 73:2798-2802). To determine if the Ad37 binding 
site on human conjunctival cells is also a protein, Chang C cells were treated 
with different proteases prior to measuring binding of 12S l-labeled Ad37. 
Digestion of surface proteins by all four proteases inhibited Ad37 binding to 
Chang C cells by greater than 50%. This finding showed that Ad37 also binds 
to a protein receptor on Chang C cells. 

Virus overlay protein blot assays (VOPBAs) were used to identify 
candidate viral protein receptors. This Western blot technique uses intact viral 
particles in place of antibodies to probe viral-receptors interactions. VOPBA was 
used herein to identify Chang C membrane proteins that bind to Ad37. In the 
absence of Ad37 particles, no protein bands were observed, while addition of 
virus in the absence of calcium revealed binding to a single 45 kOa protein. In 
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the presence of 1 mM calcium chloride, Ad37 reacted with three proteins with 
approximate molecular weights of 45, 50 and 60 kDa. The^same three proteins 
were detected using a recombinant Ad37 fiber knob alone, indicating that Ad37- 
receptor interactions are fiber mediated and do not involve, interactions of other 
5 capsid proteins such as the penton base. The size of the calcium-independent 
protein (45 kDa) is very similar to the known molecular weight of CAR. A direct 
comparison of the Ad37 VOPBA and a CAR Western blot showed that the 45 
kDa receptor co-migrates with CAR on SDS-PAGE. Moreover, two other 
members of subgroup D adenoviruses, Ad9 and AD15, have been shown to bind 

10 to CAR (Roelvink etal. (1998) J.Virol. 72:7909-7915). 

Since CAR does not appear to mediate Ad37 binding on intact Chang C 
cells, the possibility that the 50 or 60 kDa protein serves this function was 
tested by examining an adenovirus serotype that does not bind to Chang C cells. 
Ad19p, a closely related subgroup D adenovirus, binds poorly to Chang C cells 

15 (Huang etal. (1999) J. Virol. 73:2798-2802) and Ad19p recognition of the 
Ad37 receptor is therefore unlikely. Ad19p particles bound to the 45 and 60 
kDa receptors in the VOPBA, but did not bind to the 50 kDa receptor. 
Moreover, the 50 kDa receptor is expressed on Chang C cells, but not A549 
cells, which only support low levels of Ad37 binding and infection. Taken 

20 together, these data indicate that the 50 kDa protein is a primary -candidate 
receptor for Ad37 on human conjunctival cells. 
Discussion 

The identification of the CAR protein as a major adenovirus receptor does 
not explain why certain subgroup D members, such as Ad37, preferentially 

25 infect ocular cells. A 50 kDa human conjunctival cell membrane protein is 

identified herein as a primary candidate for the receptor for Ad37. This 50 kDa 
protein is not present on A549 lung epithelial cells. Ad37 binding to this receptor 
is calcium-dependent, which is consistent with Ad37 binding and infection 
experiments. Ad37 also bound to a 60 kDa protein that is present on human 

30 conjunctival and lung epithelial cells. It does not, however, appear to be 

serotype specific. The molecular weights of MHC class I heavy chain, which has 
been proposed as a receptor for AdS, and aj} 2 and ajt* intergrins, receptors for 
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the penton base, are distinct from the 50 or to kDa receptor characterized in this 
study. 

The studies of Ad37-receptor interaction using VOPBAs are consistent 
with, previous studies showing that subgroup D adenoviruses. «can bind to the 
extracellular domain of CAR (Roelvink et al. (1998) J.Virol. 72:7909-7915). 
Biochemical and structural studies on knob-CAR interactions indicate that the 
CAR binding site is located on the AB-loop of the fiber knob. Alignment of the 
fiber sequences of Ad37 and other adenoviruses reveals that the AB-loop of 
Ad37 is similar to those of Ad 1 2 and Ad5. Moreover, a phylogenetic tree of 
adenovirus knobs (Roelvink eta/. (1998) J.ViroL 72:7909-7915) shows that 
fiber proteins of subgroup D are similar to those of subgroup C and E, which use 
CAR as their primary receptor. Ad37 does not, however, appear to effectively 
use CAR as a primary receptor, as demonstrated by virus binding and infection 
studies on Chang C conjunctival cells and A549 lung epithelial cells; 

It has been reported that Ad37 uses sialic acid as a receptor on Chinese 
hamster ovary (CHO) cells and human lung carcinoma (AS49) cells {Arnberg et 
al. ((2000) J. Virol. 74:42-48). Human conjunctival cells were not studied. 
Human corneal epithelial (HCE) cells were the only ocular cell line studied and 
Ad37 binds relatively poorly to these cells, compared to binding on AS49 cells 
(Arnberg et al. ((2000) J. Virol. 74:42-48). In addition, 8,4 X 10 7 wheat germ 
agglutinin molecules per cell were required to significantly inhibit Ad37 binding 
to sialic acid on sialic acid positive CHO cells (Amberg et al. (2000) J. Virol. 
74:42-48), three orders of magnitude higher than the number of Ad37 receptors 
on Chang C conjunctival cells (Huang et al. (1999) J. Virol. 73:2798-2802). 
Clearly, sialic acid is not the only factor responsible for Ad37 binding to the cell 
surface and its influence on Ad37 tropism is unclear. 

The results herein show that Ad37 selects a 50 kDa cellular receptor for 
binding to conjunctival cells, but it is possible that sialic acid also plays a role in 
this interaction. The characterization and identification of the Ad37 receptor 
have therapeutic implications and also explain the different tropism of Ad37. 
The 50 kDA receptor for Ad37 may also be the receptor for other subgroup D 
adenoviruses that cause severe cases of EKC, Ad19a and Ad8. Ad19p is a 
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nonpathogenic variant of Ad19 (Arnberg et at. (1998) Virology 227:239-244) 
while Ad19A, along with Ad8 and Ad37, are major causes of EKC. Ad19a and 
Ad37 have identical fiber proteins (Arnberg eta/. (1998) Virology 227:239-244) 
and have similar tropism in vivo. Ad8, Ad19a, and Ad37 agglutinate dog and 
5 guinea pig erythrocytes more effectively than four other serotypes that are 
associated with less severe forms of conjunctivitis (Arnberg et al. (1998) 
Virology 227:239-244), implying that the receptors of Ad18, Ad19A, and Ad37 
have similar characteristics. Hence, this 50 kDa receptor is an attractive drug 
target against EKC caused by adenoviruses to provide therapeutic intervention of 

10 ocular diseases associated with these viruses. 

EXAMPLE 10 
Targeting of the Ad5 vector to photoreceptor cells 

The fiber-deleted adenovirus vector Ad5.GFP.AF was propagated in 705 
cells, which express a modified Ad37 fiber protein. Viral particles 

15 (Ad5.GFP.Af/37F) were harvested, CsCI-purified and dialized into 0.9% NaCI, 10 
mM Tris, pH 8.1 , and 10% glycerol. Two to three //I of the resulting solution, 
containing approximately 1 x 1 0 9 particles/^ was injected into the vitreous 
chamber of a mouse eye. Seven days post-injection, eyes were harvested, fixed 
with paraformaldehyde and cryo-sectioned. Sections were stained with an anti- 

20 rhodopsin antibody to identify photoreceptor cells and with DAPI to show all cell 
nuclei. The resulting sections showed red anti-rhodopsin staining in the 
photoreceptors, blue DAPI-stained nuclei, and green X3FP staining in any 
transduced cells. The results revealed substantially exclusive transduction of 
photoreceptors. Co-localization of rhodopsin staining and GFP expression 

25 indicated selective transduction of photoreceptor cells. 

As a control, contralateral eyes were injected with a stock of the fiber- 
deleted vector AD5./?gal.AF grown in the same Ad37 fiber-expressing cells. 
Since this virus (Ad5./?gal.AF/37F) produces /?gal rather than GFP, the green 
staining is absent from the photoreceptors. 

30 Additional experiments using the AD37 fiber for targeting to the 

photoreceptor cells have been performed. Subretinal and intravitreal injection 
have been used in mouse models and the results demonstrate targeting to the 
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photoreceptors. As with intravitreally injected eyes, the major cell type infectd 
via subretinal administration was the photoreceptor. 

As noted, Ad5.GFP.AF /37F infected Chang C cells efficiently, but A649 
cells poorly. Ad37 fiber protein confers preferential infection on human 
5 conjunctival cells, but not CAR-expressing human lung epithelial cells. Binding 
to conjunctival cells requires divalent cations. 



Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1 . An isolated nucleic acid molecule, comprising: 

adenovirus inverted terminal repeat sequences; an adenovirus packaging 

signal operatively linked thereto; and a photoreceptor-specif ic promoter. 
5 2. The isolated nucleic acid molecule of claim 1, further comprising a 

nucleic acid encoding a therapeutic product operatively linked to the promoter. 

3. . The isolated nucleic acid molecule of claim 1, wherein the 
promoter is a rhodopsin promoter. 

4. The nucleic acid molecule of claim 1, wherein the adenovirus 
10 genome does not encode a functional fiber protein such that packaging the 

nucleic acid requires complementation in a packaging cell. 

5. A recombinant adenovirus vector, comprising the nucleic acid 
molecule of any of claims 1 -4 packaged therein. 

6. A recombinant adenovirus vector of claim 5, wherein inverted 
1 5 terminal repeat sequences (ITR) and a packaging signal are derived from 

adenovirus type 2 or adenovirus type 5. 

7. A recombinant adenovirus vector of claim 5, wherein the virus 
comprises a fiber protein. 

8. A recombinant adenovirus vector of claim 7, wherein the fiber 
20 protein selectively binds to photoreceptors in the eye of a mammal. 

9. A recombinant adenovirus vector of claim 7, wherein the fiber is a 
chimera composed of N-terminal sequences from adenovirus type 2 or type 5, 
and a sufficient portion of an adenovirus serotype D fiber for selective binding to 
photoreceptors in the eye of a mammal. . 

25 10. A method for targeted delivery of a gene product to the eye of a 

mammal, comprising: 

administering a recombinant adenovirus virus that comprises heterologous 
DNA encoding the gene product or resulting in expression of the gene product, 
wherein the recombinant virus comprises a fiber protein that specifically or 

30 selectively binds to receptors that are expressed on cells in the eye. 
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1 1 . The method of claim 10, wherein the cells are photoreceptors. 

12. The method of claim 10, wherein administration is effected by 
intraocular delivery. 

13. The method of claim 10, wherein administration is effected by a 

5 method selected from subretinal injection, intravenous administration, periorbital 
administration, and intravitreal administration. 

14. The method of claim 10, wherein the recombinant virus comprises 
a fiber protein from an adenovirus type D serotype. 

15. The method of any of claims 10-14, wherein the fiber protein is an 
10 adenovirus type 37. 

16. The method of any of claims 10-14, wherein the fiber is a chimeric 
protein containing a sufficient portion of the N-terminus of an adenovirus type 2 
or type 5 fiber protein for interaction with an adenovirus type 2 or type 5 
penton, and a sufficient portion of an adenovirus serotype D knob portion of the 

15 fiber for selective binding to photoreceptors in the eye of a mammal. 

17. The method of any of claims 10-16, wherein the recombinant virus 
is an adenovirus type D serotype. 

18. The method of any of claims 10-17, wherein the encapsulated 
nucleic acid comprises a photoreceptor-specific promoter operativeiy linked to a 

20 nucleic acid comprising the therapeutic product. 

19. The method of claim 18, wherein the therapeutic product is 
selected from the group consisting of a trophic factor, an anti-apoptotic factor, a 
gene encoding a rhodopsin protein, a wild-type Stargardt disease ^gene (STDG1), 
an anti-cancer agent and a protein that regulates expression of a photoreceptor- 

25 specific gene product. 

20. The method of any of claims 10-19, wherein delivery is effected 
for treatment of an ocular disease. 

21. The method of claim 20, wherein the disorder is a retinal 
degenerative disease. 

30 22. The method of claim 20, wherein the disease is retinitis 

pigmentosa, Stargardt's disease, diabetic retinopathies, retinal vascularization, or 
retinoblastoma. 
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23. The method of any of claims 10-22, wherein the mammal is a 
human. 

24. The method of any of claims 10-22, wherein the viral nucleic acid 
comprises: 

an adenovirus inverted terminal repeat (ITR) sequences; and 
an adenovirus packaging signal operatively linked thereto. 

25. The method of claim 24, wherein the ITRs and packaging -signal 
are derived from an adenovirus serotype B or C. 

26. The method of claim 24, wherein the ITRs and packaging signal 
are derived from an adenovirus type 2 or 5. 

27. The method of claim 24, wherein the viral nucleic acid further 
comprises a photoreceptor-specific promoter. 

28. A method of targeted gene therapy, comprising: 
administering a recombinant viral vector that comprises an adenovirus 

type 37 fiber protein or portion thereof, whereby the vector selectively 
transduces photoreceptors and delivers a gene product encoded by the 
recombinant viral vector; wherein the portion is sufficient for selective binding to 
photoreceptors. 

29. The method of claim 26, wherein the vector is administered into 
the eye. 

30. The method of claim 28, wherein the vector is administered to the 
vitreous cavity of the eye. 

31 . The method of claim 28, wherein administration is effected by 
subretinal injection, intravenous administration, periorbital administration or 
intra vitreal administration. 

32. The method of any of claims 10-31, wherein at least about 10 7 
plaque forming units of virus are administered. 

33. The method of any of claims 10-31, wherein about 1 plaque 
forming unit to about 1 0 14 plaque forming units of virus are administered. 
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SEQUENCE LISTING 



<110> VON SEGGERN, DANIEL 
NEMEROW, GLEN R. 
FRIEDXiANDER , MARTIN 

<120> VECTORS FOR OCULAR TRANSDUCTION AND USE THEREFOR FOR 43ENETIC THERAPY 

<130> 756.1PCT/NOV0205P 

<140> 

<141> 2001-05-01 

<150> 09/562,934 
<151> 2000-05-01 

<160> 51 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 1 

cggtacacag aattcaggag acacaactcc 30 



<210> 2 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<210> 3 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker 
<400> 3 

cgcggatccg eg 12 



<210> 4 
<211> 8710 
<212> DNA 

<213> Artificial Sequence 



<400> 2 

gcctggatcc gggaagttac gtaacgtggg aaaac 



35 



<220> 
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<223> Description of Artificial Sequence: plasmid 
<400> 4 

cacctaaatt gtaagcgtta atattttgtt aaaattcgcg ttaaattttt gttaaatcag 60 
ctcatttttt aaccaatagg ccgaaatcgg caaaatccct tataaatcaa aagaatagac 120 
cgagataggg ttgagtgttg ttccagtttg gaacaagagt ccactattaa agaacgtgga 180 
ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat ggcccactac gtgaaccatc 240 
accctaatca agttttttgg ggt.cgagg.tg ccgtaaagca ctaaatcgga accctaaagg 300 
gagcccccga tttagagctt gacggggaaa gccggcgaac gtggcgagaa aggaagggaa 360 
gaaagcgaaa ggagcgggcg ctagggcgct ggcaagtgta gcggfccacgc tgcgcgtaac 420 
caccacaccc gccgcgctta atgcgccgct acagggcgcg tcccattcgc cattcaggct 480 
gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa S40 
agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg 600 
ttgtaaaacg acggccagtg aattgtaata cgactcacta tagggcgaat tgggtaccgg €60 
gccccccctc gaggtcgacg gtatcgataa gcttgatatc gaattcagga gacacaactc 720 
caagtgcata ctctatgtca ttttcatggg actggtctgg ccacaactac attaatgaaa 780 
tatttgccac atcctcttac actttttcat acattgccca agaataaaga atcgtttgtg 840 
ttatgtttca acgtgtttat ttttcaattg cagaaaattt -caagtcattt ttcattcagt 900 
agtatagccc caccaccaca tagcttatac agatcaccgt accttaatca aactcacaga 960 
accctagtat tcaacctgcc acctccctcc caacacacag agtacacagt cctttctccc 1020 
cggctggcct taaaaagcat catatcatgg gtaacagaca tattcttagg tgttatattc 1080 
cacacggttt cctgtcgagc caaacgctca tcagtgatat -taataaactc cccgggcagc 1140 
tcacttaagt tcatgtcgct gtccagctgc tgagccacag gctgctgtoc aacttgcggt 1200 
tgcttaacgg gcggcgaagg agaagtccac gcctacatgg gggtagagtc ataatcgtgc 1260 
atcaggatag ggcggtggtg ctgcagcagc gcgcgaataa actgctgccg ccgccgctcc 1320 
gtcctgcagg aatacaacat ggcagtggtc tcctcagcga tgattcgcac cgcocgcagc 1380 
ataaggcgcc ttgtcctccg ggcacagcag cgcaccctga tctcacttaa atcagcacag 1440 
taactgcagc acagcaccac aatattgttc aaaatcccac agtgcaaggc gctgtatcca 1S0O 
aagctcatgg cggggaccac agaacccacg tggccatcat accacaagcg -caggtagatt 1560 
aagtggcgac ccctcataaa cacgctggac ataaacatta cctcttttgg ^catgttgtaa 1620 
ttcaccacct cccggtacca tataaacctc tgattaaaca tggcgccatc -caccaccatc 1680 
ctaaaccagc tggccaaaac ctgcccgccg gctatacact gcagggaacc gggactggaa 1740 
caatgacagt ggagagccca ggactcgtaa ccatggatca tcatgctcgt catgafcatca 1800 
atgttggcac aacacaggca cacgtgcata cacttcctca ggattacaag <:tcctcccgc 1860 
gttagaacca tatcccaggg aacaacccat tcctgaatca gcgtaaatcc -cacactgcag 1920 
ggaagacctc gcacgtaact cacgttgtgc attgtcaaag tgttacattc gggcagcagc 1980 
ggatgatcct ccagtatggt agcgcgggtt tctgtctcaa aaggaggtag acgafccccta 2040 
ctgtacggag tgcgccgaga caaccgagat cgtgttggtc gtagtgtcat gccaaatgga 2100 
acgccggacg tagtcatatt tcctgaagca aaaccaggtg cgggcgtgac aaacagatct 2r60 
gcgtctccgg tctcgccgct tagatcgctc tgtgtagtag ttgtagtata tccactctct 2220 
caaagcatcc aggcgccccc tggcttcggg ttctatgtaa actccttcat gcgccgctgc 2280 
cctgataaca tccaccaccg cagaataagc cacacccagc xzaacctacac attcgttctg 2340 
cgagtcacac acgggaggag cgggaagagc tggaagaacc atgttttttt ttttattcca 2400 
aaagattatc caaaacctca aaatgaagat ctattaagtg aacgcgctcc cctocggtgg 24€0 
cgtggtcaaa ctctacagcc aaagaacaga taatggcatt tgtaagatgt tgcacaatgg 2520 
cttccaaaag gcaaacggcc ctcacgtcca agtggacgta aaggctaaac ccttcagggt 2580 
gaatctcctc tataaacatt ccagcacctt caaccatgcc -caaataattc tcatctcgec 2640 
accttctcaa tatatctcta agcaaatccc gaatattaag tccggccatt gtaaaaatct 2700 
gctccagagc gccctccacc ttcagcctca agcagcgaat catgat^gca aaaafctcagg 2T60 
ttcctcacag acctgtataa gattcaaaag cggaacatta acaaaaatac cgcgafccccg 2820 
taggtccctt cgcagggcca gctgaacata atcgtgcagg tctgcacgga -ccagcgcggc 2880 
cacttccccg ccaggaacct tgacaaaaga acccacactg attatgacac gcatacfccgg 2940 
agctatgcta accagcgtag ccccgatgta agctttgttg catgggcggc gatataaaat 3000 
gcaaggtgct gctcaaaaaa tcaggcaaag cctcgcgcaa aaaagaaagc acatcgtagt 3060 
catgctcatg cagataaagg caggtaagct ccggaaccac cacagaaaaa gacaccattt 3120 
ttctctcaaa catgtctgcg ggtttctgca taaacacaaa ataaaataac aaaaaaacat 3180 
ttaaacatta gaagcctgtc ttacaacagg aaaaacaacc cttataagca taagacggac 3240 
tacggccatg ccggcgtgac cgtaaaaaaa ctggfccaccg tgattaaaaa gcaccaccga 3300 
cagctcctcg gtcatgtccg gagtcataat gtaagactog gtaaacacat caggtfccfatt 33-60 
catcggtcag tgctaaaaag cgaccgaaat agcccggggg aatacatacc cgcaggcgta 3420 
gagacaacat tacagccccc ataggaggta taacaaaatt aataggagag aaaaacacat 3480 
aaacacctga aaaaccctcc tgcctaggca aaatagcacc <:fccccgctcc agaacaacat 3540 
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acagcgcttc acagcggcag cctaacagtc agccttacca gtaaaaaaga aaacctatta 3600 
aaaaaacacc actcgacacg gcaccagctc aatcagtcac agtgtaaaaa agggecaagt 3660 
gcagagcgag tatatatagg actaaaaaat gacgtaacgg ttaaagtcca caaaaaacac 3720 
ccagaaaacc gcacgcgaac ctacgcccag aaacgaaagc caaaaaaccc acaacttcct 3780 
caaatcgtca cttccgtttt cccacgttac gtaacttccc ggatccgcgg cattcacagt 3840 
tctccgcaag aattgattgg ctccaattct tggagtggtg aatccgttag cgaggtgccg 3900 
ccggcttcca ttcaggtcga ggtggcccgg ctccatgcac cgcgacgcaa cgcggggagg 3960 
cagacaaggt atagggcggc gcctacaatc catgccaacc cgttccatgt gctcgccgag 4020 
gcggcataaa tcgccgtgac gatcagcggt ccagtgatcg aagttaggct ggtaagagcc 4080 
gcgagcgatc cttgaagctg tccctgatgg tcgtcatcta cctgcctgga cagcatggcc 4140 
tgcaacgcgg gcatcccgat gccgccggaa gcgagaagaa tcataatggg gaaggecatc 4200 
cagcctcgcg tcgcgaacgc cagcaagacg tagcccagcg cgtcggccgc catgccctgc 4260 
ttcatccccg tggccpcgttg ctcgcgtttg ctggcggtgt ccccggaaga aatatatttg 4320 
catgtcttta gttctatgat gacacaaacc ccgcccagcg tcttgt*:att ggcgaattcg 4380 
aacacgcaga tgcagtcggg gcggcgcggt cccaggtcca cttcgcatat taaggtgacg 4440 
cgtgtggcct cgaacaccga gcgaccctgc agcgacccgc ttaacagcgt -caacagcgtg 4500 
ccgcagatcc cgggcaatga gatatgaaaa agcctgaact caccgcgacg t<:tgtcgaga 4560 
agtttctgat cgaaaagttc gacagcgfcct ccgacctgat gcagctctcg gagggogaag 4620 
aatctcgtgc tttcagcttc gatgtaggag ggcgtggata tgtcctgcgg gtaaatagct 4680 
gcgccgatgg tttctacaaa gatcgttatg tttatcggca ctttgcatcg gccgcgctcc 4740 
cgattccgga agtgcttgac attggggaat tcagcgagag -cctgacctat tgcatctecc 4800 
gccgtgcaca gggtgtcacg ttgcaagacc tgcctgaaac cgaactgccc gctgttctgc 4860 
agccggtcgc ggaggccatg gatgcgatcg ctgcggccga tcttagccag acgagcgggt 4920 
tcggcccatt cggaccgcaa ggaatcggtc aatacactac atggcgtgat ttcatatgcg 4980 
cgattgctga tccccatgtg tatcactggc aaactgtgat ggacgacacc gtcagtgcgt 5040 
ccgtcgcgca ggctctcgat gagctgatgc tttgggccga ggactgcccc gaagtccggc 5100 
acctcgtgca cgcggatttc ggctccaaca atgtcctgac ggacaatggc <:gcataacag 5160 
cggtcattga ctggagcgag gcgatgttcg gggattccca atacgaggtc gecaacatct 5220 
tcttctggag gccgtggttg gcttgtatgg agcagcagac gcgctacttc gagcggaggc 5280 
atccggagct tgcaggatcg ccgcggctcc gggcgtatat gctccgcatt ggtcfctgacc 5340 
aactctatca gagcttggtt gacggcaatt tcgatgatgc agcttgggcg cagggt-cgat 5400 
gcgacgcaat cgtccgatcc ggagccggga ctgtcgggcg tacacaaatc gcccgcagaa 5460 
gcgcggccgt ctggaccgat ggctgtgtag aagtactcgc -cgatagtgga aaccgacgcc 5520 
ccagcactcg tccgagggca aaggaatagg ggagatgggg gaggctaact gaaacacgga 5580 
aggagacaat accggaagga acccgcgcta tgacggcaat aaaaagacag aataaaacgc 5640 
acgggtgttg ggtcgtttgt tcataaacgc ggggttcggt cccagggctg gcactctgtc 5700 
gataccccac cgagacccca ttggggccaa tacgcccgcg tttcttcctt ttccccaccc 5760 
caccccccaa gttcgggtga aggcocaggg ctcgcagcca acgtcggggc ggcaggecct 5820 
gccatagcca ctggccccgt gggttaggga cggggtcccc catggggaat ggtttatggt 5880 
tcgtgggggt tattattttg ggcgttgcgt ggggtcfcggt ccacgactgg actgagcaga 5940 
cagacccatg gtttttggat ggcctgggca tggaccgcat gtactggcgc gacacgaaca €000 
ccgggcgtct gtggctgcca aacacccccg acccccaaaa accaccgcgc ggatttctgg €060 
cgcccagtgc cgtcgaccgg tcatggctgc gccccgacac ccgccaacac ccgctgacgc -6120 
gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga -ccgtctccgg 6180 
gagctgcatg tgtcagaggt tttcaccgtc at<:accgaaa cgcgcgaggc agecggatca €240 
taatcagcca taccacattt gtagaggttt tacttgcttt aaaaaacctc cccacctccc 6300 
cctgaacctg aaacataaaa tgaatgcaat tgttgttgtt aacttgttta ttgcagcfcta 6360 
taatggttac aaataaagca atagcatcac aaatttcaca aataaagcat ttttttcact 6420 
gcattctagt tgtggtttgt ccaaactcat caatgtatct tafccatgtct ggatccacta 6480 
gttctagagc ggccgccacc gcggtggagc tccagctttt gttcccttta gtgagggtta €540 
atttcgagct tggcgtaatc atggtcatag ctgtttcctg tgtgaaafctg ttatccgctc 6600 
acaattccac acaacatacg agccggaagc ataaagtgta aagcctgggg tgcctaatga €€60 
gtgagctaac tcacattaat tgcgttgcgc tcactgpccg ctttccagtc gggaaacctg €720 
tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggtt* gcgtattggg €780 
cgctcttccg cttcctcgct cactgactcg ctgcgct-cgg tcgttcggct gcggcgagcg €640 
gtatcagctc actcaaaggc ggtaatacgg £tatccacag aatcagggga taacgcagga €900 
aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg €9€0 
gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagfccag 7020 
aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg. aagctccctc 7080 
gtgcgctctc ctgttccgac cctgccgctt accggatacc -tgtccgcctt fcctcccfctcg 7140 
ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tx:agttcggt gtaggtcgtt 7200 
cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg ^gccttatcc 72€0 
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ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 7320 
actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 7380 
tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca 7440 
gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 7500 
ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 7560 
cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 7620 
ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 7680 
tttaaatcaa tctaaagtat. atatgagtaa acttggtctg. acagttacca atgcttaatc 7740 
agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc *ctgactcccc 7800 
gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata 78€0 
ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg 7920 
gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc 7980 
cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgecattgct 8040 
acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 8100 
cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt 81*60 
cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca 8220 
ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac 8280 
tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca 8340 
atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt 8400 
tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc 8460 
actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca 8520 
aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata 8580 
ctcatactct tcctttttca atattattga agcatttatc agggttattg tctcatgagc 8640 
ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc 8700 
cgaaaagtgc 8710 



<210> 5 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 6 

cataacgcgg ccgcttcttt attcttgggc 30 

<210> 7 
<211> 7148 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 



gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc 4:gctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 



<400> 5 

atgggatcca agatgaagcg cgcaagaccg 



30 




<400> 7 
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ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg -cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaecg -cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggocc gcctggeatt S40 
atgcccagta catgacctta tgggactttc ctacttggca gtacafcctac gtattagtca GOO 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 650 
actcacgggg atttccaagt ctccacccca ttgacgfecaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gcttggtacc 900 
gagctcggat ccaagatgaa gcgcgcaaga ccgtctgaag ataccttcaa ccccgtgtat 960 
ccatatgaca cggaaaccgg tcctccaact gtgccttttc *tactc<*tcc ctttgtatcc 1020 
cccaatgggt ttcaagagag tccccctggg gtaotctctt tgcgcctatc -cgaacctcta 1080 
gttacctcca atggcatgct tgcgctcaaa atgggcaacg gcctctctct ggacgaggcc 1140 
ggcaacctta cctcccaaaa tgtaaccact gtgagcccac ctctcaaaaa aaccaagtca 1200 
aacataaacc tggaaatatc tgcacccctc acagttacct -cagaagccct aactgtggct 12€0 
gccgccgcac ctctaatggt cgcgggcaac acactcacca tgcaatcaca ggccccgcta 1320 
accgtgcacg actccaaact tagcattgcc acccaaggac ccctcacagt gtcagaagga 1380 
aagctagccc tgcaaacatc aggccccctc accaccaccg atagcagtac ccttactatc 1440 
actgcctcac cccctctaac tactgccact ggtagcttgg gcattgactt gaaagagccc 1500 
atttatacac aaaatggaaa actaggacta aagtacgggg ctcctttgca tgtaacagac 1560 
gacctaaaca ctttgaccgt agcaactggt ccaggtgtga ctattaataa tacttccttg 1620 
caaactaaag ttactggagc cttgggtttt gattcacaag gcaatatgca acttaatgta IS 80 
gcaggaggac taaggattga ttctcaaaac agacgcctta tacttgatgt tagttatccg 1740 
tttgatgctc aaaaccaact aaatctaaga ctaggacagg gccctctttt ^ataaactca 1800 
gcccacaact tggatattaa ctacaacaaa ggcctttact tgtttacagc ttcaaacaat 1860 
tccaaaaagc ttgaggttaa cctaagcact gccaaggggt tgatgtttga cgctacagcc 1920 
atagccatta atgcaggaga tgggcttgaa tttggttx:ac ctaatgcacc aaacacaaat 1980 
cccctcaaaa caaaaattgg ccatggccta gaatttgatt caaacaaggc tatggtfccct 2040 
aaactaggaa ctggccttag ttttgacagc acaggtgcca ttacagtagg aaacaaaaat 2100 
aatgataagc taactttgtg gaccacacca gctccatctc -ctaactgtag actaaatgca 2160 
gagaaagatg ctaaactcac tttggtctta acaaaatgtg gcagtcaaat acttgctaca 2220 
gtttcagttt tggctgttaa aggcagtttg gctccaatat ctggaacagt tcaaagtgct 2280 
catcttatta taagatttga cgaaaatgga gtgctactaa acaattectt cctggaccca 2340 
gaatattgga actttagaaa tggagatctt actgaaggca cagcctatac aaacgctgtt 2400 
ggatttatgc ctaacctatc agcttatcca aaatctcacg gtaaaactgc «caaaagtaac 24€0 
attgtcagtc aagtttactt aaacggagac aaaactaaac <:tgtaacact aaccattaca 2520 
ctaaacggta cacaggaaac aggagacaca actccaagtg catactctat gtcattttca 2580 
tgggactggt ctggccacaa ctacattaat gaaatatttg ^ccacatcctc ttacactttt 2640 
tcatacattg cccaagaata aagaagcggc cgct-cgagca tgcatctaga gggccctatt 2700 
ctatagtgtc acctaaatgc tagagctcgc tgatcagcct cgactgtgcc ttctagttgc 27S0 
cagccatctg ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc 2820 
actgtccttt cctaataaaa tgaggaaatt gcatcgcatt gt<*gagtag gtgfccattct 2880 
attctggggg gtggggtggg gcaggacagc aagggggagg at-tgggaaga -caatagcagg 2940 
catgctgggg atgcggtggg ctctatggct tctgaggcgg aaagaaccag ctggggctct 3000 
agggggtatc cccacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg 3050 
cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctccttfecgc tfctctfcccct 3120 
tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg catcccttta 3180 
gggttccgat ttagtgcttt acggcacctc gaccccaaaa aacttgatta gggtgatggt 3240 
tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt ggagtocacg 3300 
ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat ctcggtctat 33*60 
tcttttgatt tataagggat tttggggatt tcggcctatt ggtfcaaaaaa tgagctgatt 3420 
taacaaaaat ttaacgcgaa ttaattctgt ggaatgtgtg tcagttaggg tgtggaaagt 3480 
ccccaggctc cccaggcagg cagaagtatg caaagcatgc atcfccaatt:a gtcagcaacc 3S40 
aggtgtggaa agtccccagg ctccccagca ggcagaagta tgcaaagcat gcatctcaat 3600 
tagtcagcaa ccatagtccc gcccctaact ccgcccatcc cgcccctaac tccgcccagt 3€€0 
tccgcccatt ctccgcccca tggctgacta atttttttta tttatgcaga ggecgaggcc 3720 
gcctctgcct ctgagctatt ccagaagtag tgaggaggct tttttggagg cctaggcttt 3780 
tgcaaaaagc tcccgggagc ttgtatatcc attttcggat ctgafccaaga gacaggatga 3840 
ggatcgtttc gcatgattga acaagatgga ttgcacgcag gttctecggc cgcttgggtg 3900 
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gagaggctat tcggctatga ctgggcacaa cagacaatcg gctgctctga tgccgccgtg 3960 
ttccggctgt cagcgcaggg gcgcccggtt ctttttgtca agaccgacct gtccggtgcc 4020 
ctgaatgaac tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct 4080 
tgcgcagctg tgctcgacgt tgtcactgaa gcgggaaggg actggctgct attgggcgaa 4140 
gtgccggggc aggatctcct gtcatctcac cttgctcctg ccgagaaagt atccatcatg 4200 
gctgatgcaa tgcggcggct gcatacgctt gatccggcta cctgcccatt cgaccaccaa 4260 
gcgaaacatc gcatcgagcg agcacgtact cggatggaag ccggtcttgt cgatcaggat 4320 
gatctggacg aagagcatca ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg 4380 
cgcatgcccg acggcgagga tctcgtcgtg acccatggcg atgcctgctt gccgaatatc 4440 
atggtggaaa atggccgctt ttctggattc atcgactgtg gccggctggg tgtggcggac 4500 
cgctatcagg acatagcgtt ggctacccgt gatattgctg aagagcttgg cggcgaatgg 4S60 
gctgaccgct tcctcgtgct ttacggtatc gccgctcccg attcgcagcg <:atcgccttc 4620 
tatcgccttc ttgacgagtt cttctgagcg ggactctggg gttcgaaatg accgaccaag 4680 
cgacgcccaa cctgccatca cgagatttcg attccaccgc cgccttctat gaaaggt^gg 4740 
gcttcggaat cgttttccgg gacgccggct ggatgatcct ccagcgcggg gatctcatgc 4800 
tggagttctt cgcccacccc aacttgttta ttgcagctta taatggttac aaataaagca 4860 
atagcatcac aaatttcaca aataaagcat ttttttcact gcattctagt tgtggtttgt 4920 
ccaaactcat caatgtatct tatcatgtct gtataccgtc gacctctagc tagagcttgg 4980 
cgtaatcatg gtcatagctg tttcctgtgt gaaattgtta tccgctcaca attccacaca S040 
acatacgagc cggaagcata aagtgtaaag cctggggtgc ctaatgagtg agctaactca 5100 
cattaattgc gttgcgctca ctgcccgctt tccagtcggg aaacctgtcg tgccagctgc 5160 
attaatgaat cggccaacgc gcggggagag gcggtttgcg -tattgggcgc tcttccgctt 5220 
cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta tcagctcact 5280 
caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag aacatgtgag 5340 
caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg tttttccata 5400 
ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg tggcgaaacc 5460 
cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg -cgctctcctg 5520 
ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga agcgtggcgc 5580 
tttctcaatg ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc tccaagctgg 5640 
gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt aactatcgtc 5700 
ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact ggtaacagga 5760 
ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg cctaactacg 5820 
gctacactag aaggacagta tttggtatct gcgctctgct gaagccagtt accttcggaa 5880 
aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt ggtttttttg 5940 
tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct ttgatctttt 60 00 
ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg gfccatgagat 6060 
tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt aaatcaatct 6120 
aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt gaggcaccta 6180 
tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc gtgtagataa 6240 
ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg cgagacccac 6300 
gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc gagcgcagaa 6360 
gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg gaagctagag 6420 
taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca ggcatcgtgg 6480 
tgtcacgctc gtcgtttggt atggctfccat tcagctecgg ttcecaacga tcaaggcgag 6540 
ttacatgatc ccccatgttg tgcaaaaaag cggttagctc ^rttcggtcct ccgatcgttg 6600 
tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcaetg cataatfcctc 6660 
ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca accaagtcat 6720 
tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata <:gggataata 6780 
ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct tcggggcgaa ^6840 
aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact -cgtgcaccca 6900 
actgatcttc agcatctttt acttfccacca gcgtttctgg gtgagcaaaa acaggaaggc 6960 
aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc atactcttcc 7020 
tttttcaata ttattgaagc atttatcagg gttattgtct catgagcgga tacatatttg 7080 
aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga aaagtgccac 7140 
ctgacgtc 7148 



<210> 8 
<211> 7469 
<212> DMA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: plasntid 
<400> 8 

gacggatcgg gagatctccc gatcccctat ggtcgact-ct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg <cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat ageccatata 300 
tggagttccg cgttacataa cttacggtaa atggcccgcc -tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta -catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca -600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg -caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gcttggtacc 900 
gagctcggat ctgaattcga gctcgctgtt gggctcgcgg ttgaggacaa actctfccgcg 960 
gtctttccag tactcttgga tcggaaaccc gtcggcctcc gaacggtact -ccgccaccga 1020 
gggacctgag cgagtccgca tcgaccggat cggaaaacct ctcgagaaag gcgtctaacc 1080 
agtcacagtc gcaaggtagg ctgagcaccg tggcgggcgg cagcgggtgg ^cggtcggggt 1140 
tgtttctggc ggaggtgctg ctgatgatgt aattaaagta ggcggtcttg agacggcgga 1200 
tggtcgaggt gaggtgtggc aggcttgaga tccaagatga agcgcgcaag accgtctgaa 1260 
gataccttca accccgtgta tccatatgac acggaaaccg gtcctccaac tgtgcctttt 1320 
cttactcctc cctttgtatc ccccaatggg tttcaagaga gtccccctgg ggtactctct 1380 
ttgcgcctat ccgaacctct agttacctcc aatggcatgc ttgcgctcaa aatgggcaac 1440 
ggcctctctc tggacgaggc cggcaacctt acctcccaaa atgtaaccac tgtgagccca 1500 
cctctcaaaa aaaccaagtc aaacataaac ctggaaatat ctgcacccct cacagttacc 1560 
tcagaagccc taactgtggc tgccgccgca cctctaatgg 4>cgcgggcaa -cacactcacc 1620 
atgcaatcac aggccccgct aaccgtgcac gactccaaac ^tagcattgc caoccaagga 1680 
cccctcacag tgtcagaagg aaagctagcc ctgcaaacat -caggccccct caccaccacc 1740 
gatagcagta cccttactat cactgcctca ccccctctaa -ctactgccac tggtagcttg 1800 
ggcattgact • tgaaagagcc catttataca caaaatggaa aactaggact aaagtacggg 1860 
gctcctttgc atgtaacaga cgacctaaac actttgaccg tagcaactgg tccaggtgtg 1920 
actattaata atacttcctt gcaaactaaa gttactggag ccttgggttt -tgattcacaa 1980 
ggcaatatgc aacttaatgt agcaggagga ctaaggattg att<:tcaaaa <:agacgcctt 2040 
atacttgatg ttagttatcc gtttgatgct caaaaccaac taaatctaag actaggacag 2100 
ggccctcttt ttataaactc agcccacaac ttggatatta actacaacaa aggcctttac 21-60 
ttgtttacag cttcaaacaa ttccaaaaag cttgaggtta acctaagcac tgccaagggg 2220 
ttgatgtttg acgctacagc catagccatt aatgcaggag atgggcttga attfcggttca 2280 
cctaatgcac caaacacaaa tcccctcaaa acaaaaattg gccatggcct agaatttgat 2340 
tcaaacaagg ctatggttcc taaactagga actggcctta gttttgacag -cacaggtgcc 2400 
attacagtag gaaacaaaaa taatgataag ctaactttgt ggaccacacc agctecatct 2460 
cctaactgta gactaaatgc agagaaagat gctaaactca ctttggtctfc aacaaaatgt .2520 
ggcagtcaaa tacttgctac agtttcagtt ttggctgtta aaggcagttt ggctccaata 2580 
tctggaacag ttcaaagtgc tcatcttatt ataagatttg acgaaaatgg agtgctacta 2640 
aacaattcct tcctggaccc agaatattgg aactttagaa atggagatct tactgaaggc 2700 
acagcctata caaacgctgt tggatttatg cctaacctat <:agcttatcc aaaafcctcac 2760 
ggtaaaactg ccaaaagtaa cattgtcagt .caagtttact taaacggaga caaaactaaa 2820 
cctgtaacac taaccattac actaaacggt acacaggaaa caggagacac aactccaagt 2880 
gcatactcta tgtcattttc atgggactgg tctggccaca actacattaa tgaaatattt 2940 
gccacatcct cttacacttt ttcatacatt gcccaagaat aaagaagcgg ccgctcgagc 3000 
atgcatctag agggccctat tctatagtgt cacctaaatg ctagagcfccg ctgatcagcc 3060 
tcgactgtgc cttctagttg ccagccatct gttgtttgcc cctcccccgt gccttccfctg 3120 
accctggaag gtgccactcc cactgtcctt tcctaataaa atgaggaaat tgcatcgcat 3180 
tgtctgagta ggtgtcattc tattctgggg ggtggggtgg ggcaggacag x:aagggggag 3240 
gattgggaag acaatagcag gcatgctggg gatgcggtgg gctctatggc fctctgaggcg 3300 
gaaagaacca gctggggctc tagggggtat ccccacgcgc *cctgtagcgg -cgcattaagc 33-60 
gcggcgggtg tggtggttac gcgcagcgtg accgctacac fctgecagcgc cctagcgccc 3420 
gctcctttcg ctttcttccc ttcctttctc gecacgtt-cg ccggctttcc ccgtcaagct 3480 
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ctaaatcggg gcatcccttt agggttccga tttagtgctt tacggcacct ^gaccccaaa 3540 
aaacttgatt agggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc 3600 
cctttgacgt tggagtccac gttctttaat agtggactct tgttccaaac *ggaacaaca 3660 
ctcaacccta tctcggtcta ttcttttgat ttataaggga ttttggggat tfccggcctat 3720 
tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga attaattctg tggaatgtgt 3780 
gtcagttagg gtgtggaaag tccccaggct ccccaggcag gcagaagtat gcaaagcatg 3840 
catctcaatt agtcagcaac caggtgtgga aagtccccag gctccccagc aggcagaagt 3900 
atgcaaagca tgcatctcaa ttagtcagca accatagtcc cgcccctaae tccgeccato 3960 
ccgcccctaa ctccgcccag ttccgcccat tctccgcccc atggctgact aatttttttt 4020 
atttatgcag aggccgaggc cgcctctgcc tctgagctat tccagaagta gtgaggagge 4080 
ttttttggag gcctaggctt ttgcaaaaag ctcccgggag cttgtatatc ^cattttcgga 4140 
tctgatcaag agacaggatg aggatcgttt cgcatgattg aacaagatgg attgcacgca 4200 
ggttctccgg ccgcttgggt ggagaggcta ttcggctatg actgggcaca acagacaatc 4260 
ggctgctctg atgccgccgt gttccggctg tcagcgcagg ggcgcccggt tctttttgtc 4320 
aagaccgacc tgtccggtgc cctgaatgaa ctgcaggacg aggcagcgcg gctatcgtgg 4330 
ctggccacga cgggcgttcc ttgcgcagct gtgctcgacg ttgtcactga agcgggaagg 4440 
gactggctgc tattgggcga agtgccgggg caggatctcc tgtcatctca ccttgctcct 4S00 
gccgagaaag tatccatcat ggctgatgca atgcggcggc tgcatacgct tgatccgget 4560 
acctgcccat tcgaccacca agcgaaacat cgcatcgagc gagcacgtac tcggatggaa 4620 
gccggtcttg tcgatcagga tgatctggac gaagagcatc aggggctcgc gccagccgaa 4680 
ctgttcgcca ggctcaaggc gcgcatgccc gacggcgagg atctcgtcgt gacccafcgge 4740 
gatgcctgct tgccgaatat catggtggaa aatggccgct tttctggatt <:atcgactgt 4800 
ggccggctgg gtgtggcgga ccgctatcag gacatagcgt tggctacccg tgatattgct 4860 
gaagagcttg gcggcgaatg ggctgaccgc ttcctcgtgc tttacggtat cgccgctccc 4920 
gattcgcagc gcatcgcctt ctatcgcctt cttgacgagt tcttctgagc gggactctgg 4980 
ggttcgaaat gaccgaccaa gcgacgccca acctgccatc acgagatttc gatfcccaccg 5040 
ccgccttcta' tgaaaggttg ggcttcggaa tcgtfcttccg ggacgccggc tggatgatcc 5100 
tccagcgcgg ggatctcatg ctggagttct tcgcccaccc caacttgttt attgcagctt 51*60 
ataatggtta caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac 5220 
tgcattctag ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc tgtataccgt 5280 
cgacctctag ctagagcttg gcgtaatcat ggtcatagct gtttcctgtg fcgaaattgtt 5340 
atccgctcac aattccacac aacatacgag ccggaagcat aaagtgtaaa gcetggggtg 5400 
cctaatgagt gagctaactc acattaattg cgttgcgctc actgcccgct ttccagtcgg 54*60 
gaaacctgtc gtgccagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc 5520 
gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 5580 
ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 5640 
acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 5700 
cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 5760 
caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt 'cccoctggaa 5820 
gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgccttfcc S880 
tcccttcggg aagcgtggcg ctttctcaat gctcacgctg taggtatctc agttcggtgt 5940 
aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ^6000 
ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 6060 
cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 6120 
tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc 6180 
tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa <:aaaccaccg 6240 
ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 6300 
aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcaegtt 6360 
aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt fetaaattaaa 6420 
aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat 6480 
gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgect 6540 
gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg 6600 
caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag 6660 
ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc -cagtctatta 67*20 
attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtt-tgcgc aacgt*gttg 6780 
ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca *fccagct<rcg 6840 
gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct 6900 
ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ofccatggtta 6960 
tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt -tctgtgactg 7020 
gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt -tgctcttgcc 7080 
cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg =cbcatcattg 7140 
gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgfctgaga fcccagtfccga 7200 
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tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg 7250 
ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 7320 
gttgaatact catactcttc ctttttcaat attattgaag -catttatcag ggttattgtc 7380 
tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca 7440 
catttccccg aaaagtgcca cctgacgtc ~ 7469 

<210> 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 9 

tgcttaagcg gccgcgaagg agaagtcc 28 

<210> 10 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 10 

ccgagctagc gactgaaaat gag 23 



<210> 11 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<210> 12 
<211> 11152 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 12 

aagcttgggc agaaatggtt gaactcccga gagtgtccta cacctagggg agaagcagcc 60 
aaggggttgt ttcccaccaa ggacgacccg tctgcgcaca aacggatgag -cccatcagac 120 
aaagacatat tcattctctg ctgcaaactt ggcatagctc tgctttgcct ggggctattg 180 
ggggaagttg cggttcgtgc tcgcagggct ctcacccttg actcttttaa tagctcttct 240 
gtgcaagatt acaatctaaa caattcggag aactcgacct tcctcctgag gcaaggacca 300 
cagccaactt cctcttacaa gccgcatcga ttttgtcctt cagaaataga aataagaatg 3*60 
cttgctaaaa attatatttt taccaataag accaatccaa taggtagatt attagttact 420 
atgttaagaa atgaatcatt atcttttagt actattttta ctcaaattca gaagttagaa 480 
atgggaatag aaaatagaaa gagacgctca acctcaattg aagaacaggt gcaaggacta 540 
ttgaccacag gcctagaagt aaaaaaggga aaaaagagtg tttttgtcaa aataggagac 600 
a 99tggtggc aaccagggac ttatagggga ccttacatct acagaccaac agatgceccc -660 
ttaccatata caggaagata tgacttaaat tgggataggt gggttacagt -caatggctat 720 



<400> 11 

cctctcgaga gacagcaaga cac 



23 
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aaagtgttat atagatccct cccttttcgt 
tgtatgttgt ctcaagaaga aaaagacgac 
ctaggaacag gaatgcactt ttggggaaag 
ggactaatag aacattattc tgcaaaaact 
ttggcccaac cttgcggttc ccagggctta 
aacgaggatg tgagacaagt ggtttcctga 
gctctgagtg ttctattttc ctatgttctt 
ttatgtaaac caagatataa aagagtgctg 
cattcacctc ttgtgtgttt gtgtctgttc 
tcactttcca gagggtcccc ccgcagaccc 
gctggcgccc gaacagggac cctcggataa 
tgtttgtctt gtattgtctc tttcttgtct 
ccatagggac caagctagcg actgaaaatg 
accgaagaaa tggccgccag tcttttggac 
cttccacctc ctagccattt tgaaccacct 
acggcccccg aagatcccaa cgaggaggcg 
ttggcggtgc aggaagggat tgacttactc 
ccgcctcacc tttcccggca gcccgagcag 
atgccaaacc ttgtaccgga ggtgatcgat 
agtgacgacg aggatgaaga gggtgaggag 
cacggttgca ggtcttgtca ttatcaccgg 
tcgctttgct atatgaggac ctgtggcatg 
gtgggtgata gagtggtggg tttggtgtgg 
ggtttaaaga attttgtatt gtgatttttt 
agcccgagcc agaaccggag cctgcaagac 
tcctgagacg cccgacatca cctgtgtcta 
actccggtcc ttctaacaca cctcctgaga 
aaccagttgc cgtgagagtt ggtgggcgtc 
ttaacgagcc tgggcaacct ttggacttga 
acctgtgatt gcgtgtgtgg ttaacgcctt 
taaagggtga gataatgttt aacttgcatg 
atataatgcg ccgtgggcta atcttggtta 
tggaagattt ttctgctgtg cgtaacttgc 
tttggaggtt tctgtggggc tcatcccagg 
acaagtggga atttgaagag cttttgaaat 
tgggtcacca ggcgcttttc caagagaagg 
ggcgcgctgc ggctgctgtt gcttttttga 
cccatctgag cggggggtac ctgctggatt 
tgagacacaa gaatcgcctg ctactgttgt 
aggagcagca gcagcagcag gaggaagcca 
acccgagagc cggcctggac cctcgggaat 
agaactgaga cgcattttga caattacaga 
ggagcggggg gcttgtgagg ctacagagga 
cagacaccgt cctgagtgta ttacttttca 
tgatctgctg gcgcagaagt attccataga 
ggatgatttt gaggaggcta ttagggtata 
gtacaagatc agcaaacttg taaatatcag 
cgaggtggag atagatacgg aggatagggt 
gccgggggtg cttggcatgg acggggtggt 
ttttagcggt acggttttcc tggccaatac 
tgggtttaac aatacctgtg tggaagcctg 
ttactgctgc tggaaggggg tggtgtgtcg 
cctctttgaa aggtgtacct tgggtatcct 
tgtggcctcc gactgtggtt gcttcatgct 
catggtatgt ggcaactgcg aggacagggc 
ctgtcacctg ctgaagacca ttcacgtagc 
tgagcataac atactgaccc gctgttcctt 
accttaccaa tgcaatttga gtcacactaa 
ggtgaacctg aacggggtgt ttgacatgac 
tgagacccgc accaggtgca gaccctgcga 
tgtgatgctg gatgtgaccg aggagctgag 
cgctgagttt ggctctagcg atgaagatac 



gaaagactcg ccagagctag acctccttgg 780 
atgaaacaac aggtacatga ttatatttat 840 
attttccata ccaaggaggg gacagtggct 900 
catggcatga gttattatga atagccttta 960 
agtaagtttt tggttacaaa <:tgttcttaa 1020 
cttggtttgg tatcaaaggt tctgatctga 1080 
ttggaattta tccaaatctt atgtaaatgc 1140 
attttttgag. taaacttgca acagtcctaa 1200. 
gccatcccgt ctccgctcgt cacfctatcct 12^0 
cggcgaccct caggtcggcc gactgcggca 1320 
gtgacccttg tctctatttc tactatttgg 1380 
ggctatcatc acaagagcgg aacggactca 1440 
agacatatta tctgccacgg aggtgttatt 1500 
cagctgatcg aagaggtact ggctgataat 1S60 
acccttcacg aactgtatga tttagacgtg 1620 
gtttcgcaga tttttcccga ctctgtaatg 1680 
acttttccgc cggcgcccgg ttctccggag 1740 
ccggagcaga gagccttggg tccggtttct 1800 
cttacctgcc acgaggctgg -ctttccaccc 1860 
tttgtgttag attatgtgga gcaccccggg 1920 
aggaatacgg gggacccaga tattatgtgt 1980 
tttgtctaca gtaagtgaaa attatgggca 2040 
taattttttt tttaattttt acagttttgt 2100 
taaaaggtcc tgtgtctgaa cctgagcctg 2160 
ctacccgccg tcctaaaatg gcgcctgcta 2220 
gagaatgcaa tagtagtacg gatagctgtg 2280 
tacacccggt ggtcccgctg tgccccatta 2340 
gccaggctgt ggaatgtatc gaggacttgc 2400 
gctgtaaacg ccccaggcca taaggtgtaa 24*0 
tgtttgctga atgagttgat gtaagtttaa 2520 
gcgtgttaaa tggggcgggg cttaaagggt 2580 
catctgacct catggaggct tgggagtgtt 2640 
tggaacagag ctctaacagt acctcttggt 2700 
caaagttagt ctgcagaatt aaggaggatt 2760 
cctgtggtga gctgtttgat tctttgaatc 2820 
tcatcaagac tttggatttt tccacaccgg 2880 
gttttataaa ggataaatgg agcgaagaaa 2940 
ttctggccat gcatctgtgg agagcggttg 3000 
cttccgtccg cccggegata ataccgacgg 3060 
ggcggcggcg gcaggagcag agoccatgga 3120 
gaatgttgta caggtggctg aactgtatcc 3180 
ggatgggcag gggctaaagg gggtaaagag 3240 
ggctaggaat ctagctttta gcttaatgac 3300 
acagatcaag gataattgcg ctaatgagct 3360 
gcagctgacc acttactggc *tgcagccagg 3420 
tgcaaaggtg gcacttaggc cagattgcaa 3480 
gaattgttgc tacatttctg ggaacggggc 3S40 
ggcctttaga tgtagcatga taaatatgtg 3600 
tattatgaat gtaaggttta ctggccccaa ZttO 
caaccttatc ctacacggtg taagcttcta 3720 
gaccgatgta agggttcggg gctgtgcctt 3780 
ccccaaaagc agggcttcaa fctaagaaatg 3840 
gtctgagggt aactccaggg tgcgccacaa 3900 
agtgaaaagc gtggctgtga ttaagcataa 3960 
ctctcagatg ctgacctgct .cggacggcaa 4020 
cagccactct cgcaaggcct ggccagtgtt 4080 
gcatttgggt aacaggaggg gggtgttcct 4140 
gatattgctt gagcccgaga gcatgfcccaa 4200 
catgaagatc tggaaggtgc tgaggtacga 4260 
gtgtggcggt aaacatatta ggaaccagcc 4320 
gcccgatoac ttggtgctgg <x;tgcacccg 4380 
agattgaggt actgaaatgt gtgggcgtgg 4440 
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cttaagggtg ggaaagaata tataaggtgg 
agcagccgcc gccgccatga gcaccaactc 
gacaacgcgc atgcccccat gggccggggt 
tggtcgcccc gtcctgcccg caaactctac 
gccgttggag actgcagcct ccgccgccgc 
tgtgactgac tttgctttcc tgagcccgct 
ccgcgatgac aagttgacgg ctcttttggc 
tgtcgtttct cagcagctgt tggatctgcg 
ccctcccaat gcggtttaaa acataaataa 
gcaagtgtct tgctgtctct cgagggatct 
cataattgga caaactacct acagagattt 
gtgtataatg tgttaaacta ctgattctaa 
gaactgatga atgggagcag tggtggaatg 
aagaaatgcc atctagtgat gatgaggcta 
aaaagaagag aaaggtagaa gaccccaagg 
gtcatgctgt gtttagtaat agaactcttg 
aagctgcact gctatacaag aaaattatgg 
ataacagtta taatcataac atactgtttt 
ctattaataa ctatgctcaa aaattgtgta 
ataaggaata tttgatgtat agtgccttga 
gtagaggttt tacttgcttt aaaaaacctc 
atgaatgcaa ttgttgttgt taacttgttt 
aatagcatca caaatttcac aaataaagca 
tccaaactca tcaatgtatc ttatcatgtc 
gggtgtggaa agtccccagg ctccccagca 
tagtcagcaa ccaggtgtgg aaagtcccca 
atgcatctca attagtcagc aaccatagtc 
actccgccca gttccgccca ttctccgccc 
gaggccgagg ccgcctcggc ctctgagcta 
ggcctaggct tttgcaaaaa gcttggacac 
accactttat cccgcgtcag ggagaggcag 
actggttttt agtgcgccag atctctataa 
tttttaagcc gtagataaac aggctgggac 
ctgggacatg ttgcagatcc atgcacgtaa 
atggaaaggc attattgccg taagccgtgg 
tgaactgggt attcgtcatg tcgataccgt 
gcgcgagctt aaagtgctga aacgcgcaga 
tgacctggtg gataccggtg gtactgcggt 
ctttgtcacc atcttcgcaa aaccggctgg 
tatcccgcaa gatacctgga ttgaacagcc 
aatctccggt cgctaatctt ttcaacgcct 
caggcgggtt acaatagttt ccagtaagta 
acctgagcga aaccctgttc aaaccccgct 
gccgctttaa tcacggcgca caaccgcctg 
ctcactggta tcgcatgatt aaccgtctga 
aaatcctcga cgtccaggca cgtattgtga 
tatacgatac ggtgattggc taccgtggcg 
tttgtgaagg aaccttactt ctgtggtgtg 
taaagctcta aggtaaatat aaaattttta 
attgtttgtg tattttagat tccaacctat 
gcctttaatg aggaaaacct gttttgcfcca 
actgctgact ctcaacattc tactcctcca 
gactttcctt cagaattgct aagttttttg 
gcttgctttg ctatttacac cacaaaggaa 
gaaaaatatt ctgtaacctt tataagtagg 
tttcttactc cacacaggca tagagtgtct 
acctttagct ttttaatttg taaaggggtt 
actagagatc ataatcagcc ataccacatt 
cccacacctc cccctgaacc tgaaacataa 
tattgcagct tataatggtt acaaataaag 
atttttttca ctgcattcta gttgtggttt 
ctggatcccc aggaagctcc tctgtgtcct 



gggtcttatg tagttttgta tctgttttgc 4500 
gtttgatgga agcattgtga gctcatattt 4560 
gcgtcagaat gtgatgggct ccagcattga 4620 
taccttgacc tacgagaccg tgtctggaac 4680 
ttcagccgct gcagccaccg cccgcgggat 4740 
tgcaagcagt gcagcttccc gtt<:atccgc 4800 
acaattggat tctttgaccc gggaacttaa 4860 
ccagcaggtt tctgccctga aggctt-cctc 4S20 
aaaaccagac tctgtttgga tttggatcaa 4980 
ttgtgaagga accttacttc tgtggtgtga 5040 
aaagctctaa ggtaaatata aaatttttaa 5100 
ttgtttgtgt attttagatt ocaacctatg S160 
cctttaatga ggaaaacctg ttttgctcag 5220 
ctgctgactc tcaacattct acfccctccaa 5280 
actttccttc agaattgcta agttttttga 5340 
cttgctttgc tatttacacc acaaaggaaa 5400 
aaaaatattc tgtaaccttt ataagtaggc 5*60 
ttcttactcc acacaggcat agagtgtctg 5520 
cctttagctt tttaatttgt aaaggggtta 5580 
ctagagatca taatcagcca taccacattt 5640 
ccacacctcc ccctgaacct .gaaacataaa 5700 
attgcagctt ataatggtta -caaataaagc 5760 
tttttttcac tgcattctag ttgtggtttg 5820 
tggatccggc tgtggaatgt gtgtcagtta 5880 
ggcagaagta tgcaaagcat gcatctcaat 5940 
ggctccccag caggcagaag tatgeaaagc 6000 
ccgcccctaa ctccgcccat <2ccgccccta 6060 
-catggetgac taattttttt tatttatgca 6120 
ttccagaagt agtgaggagg cttttttgga 6180 
aagacaggct tgcgagatat gtttgagaat 6240 
tgcgtaaaaa gacgcggact catgtgaaat 6300 
tctcgcgcaa cctatfrttcc cctcgaacac 6360 
acttcacatg agcgaaaaat acatcgtcac ~6420 
aotcgcaagc cgactgatgc -cttctgaaca 6480 
cggtctggta ccgggtgcgt tactggcgcg 6540 
ttgtatttcc agctacgatc acgacaacca 6^600 
aggcgatggc gaaggcttca tcgttattga 6660 
tgcgattcgt gaaatgtatc <:aaaagcgca 6720 
tcgtccgctg gttgatgact atgttgttga 6780 
gtgggatatg ggcgtcgtat tcgtcccgcc 6840 
ggcactgccg ggcgttgttc tttttaactt 6900 
ttctggaggc tgcatocatg acacaggcaa 6960 
ttaaacatcc tgaaacctcg acgctagtcc 7020 
tgcagtcggc ccttgatggt aaaaccatcc 7080 
tgtggatctg gcgcggcatt gacccacgcg 7140 
tgagcgatgc -cgaacgtacc gacgatgatt 7200 
gcaactggat ttatgagtgg gccccggatc 7260 
acataattgg acaaactacc tacagagatt 7320 
agtgtataat gtgttaaact actgattcta 7380 
ggaactgatg aatgggagca gtggtggaat 7440 
gaagaaatgc catctagtga ^tgatgaggct 7500 
aaaaagaaga gaaaggtaga agaccccaag 7560 
agtcatgctg tgtttagtaa -tagaactctt 7620 
aaagctgcac tgctatacaa gaaaattatg 7680 
cataacagtt ataatcataa catactgttt 7740 
gctattaata actatgcfcca aaaattgtgt 7800 
aataaggaat atttgatgta tagtgccttg 7860 
tgtagaggtt ttacttgctt -taaaaaacct 7920 
aatgaatgca attgttgttg ttaacttgtt 7980 
caatagcatc acaaatttca caaataaagc 8040 
gtccaaactc atcaatgtat ctfcatcatgt 8100 
cataaaccct aacctcctct acttgagagg 8160 
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acattccaat cataggctgc ccatccaccc tctgtgtcct cctgttaatt aggtcactta 8220 
acaaaaagga aattgggtag gggtttttca cagaccgctt tctaagggta attttaaaat 8280 
atctgggaag tcccttccac tgctgtgttc cagaagtgtt ggtaaacagc ccacaaatgt 8340 
caacagcaga aacatacaag ctgtcagctt tgcacaaggg cccaacaccc tgctcatcaa 8400 
gaagcactgt ggttgctgtg ttagtaatgt gcaaaacagg aggcacattt tccccacctg 8460 
tgtaggttcc aaaatatcta gtgttttcat ttttacttgg atcaggaacc cagcactcca £520 
ctggataagc attatcctta tccaaaacag ccttgtggtc agtgttcatc tgctgactgt 8S80 
caactgtagc attttttggg gttacagttt gagcaggata tttggtcctg. tagtttgcta 8640 
acacaccctg cagctccaaa ggttccccac caacagcaaa aaaatgaaaa tttgaccctt 8700 
gaatgggttt tccagcacca ttttcatgag ttttttgtgt ccctgaatgc aagtttaaca 8750 
tagcagttac- cccaataacc tcagttttaa cagtaacagc ttcccacatc aaaatatttc 8820 
cacaggttaa gtcctcattt aaattaggca aaggaattct tgaagacgaa agggcctcgt 8880 
gatacgccta tttttatagg ttaatgtcat gataataatg gtttcttaga cgtcaggtgg 8940 
cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacatfccaaa 9000 
tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa 9060 
gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct 9120 
tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 9180 
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagtttfccg 9240 
ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt 9300 
atcccgtgtt gacgccgggc aagagcaact cggtcgccgc atacactatt cfccagaatga 9360 
cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 9420 
attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 9480 
gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg 9S40 
ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac 9*600 
gatgcctgca gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct 9660 
agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct 9720 
gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagecg gtgagcgtgg 9780 
gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat 9840 
ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg 9900 
tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat 9960 
tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatet 10020 
catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 10080 
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 10140 
aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 10200 
gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag tgtagecgta 10260 
gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgcfcc *tgctaatoct 10320 
Sttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 10380 
atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag 10440 
cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc 10S00 
cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg fccggaacagg 10S60 
agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 10620 
tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 10680 
gaaaaacgcc agcaacgcgg cctttttacg gtfccctggcc ttttgctggc cttttgctca 10740 
catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg cctttgagtg 10800 
agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc 10860 
ggaagagcgc ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat 10920 
atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagt atacactccg 10980 
ctatcgctac gtgactgggt catggctgcg ccccgacacc cgccaacacc ^gctgacgcg 11040 
ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac -cgtctecggg 111O0 
agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgaggca gc " 11152 

<210> 13 
<2U> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
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<210> 14 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 14 

ccgcctcaga agccatagag cc 22 



<210> 15 
<211> 14455 
<212> DNA 
• <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 15 

aagcttgggc agaaatggtt gaactcccga gagtgtecta cacctagggg agaagcagcc 60 
aaggggttgt ttcccaccaa ggacgacccg tctgcgcaca aacggatgag >cccatcagac 120 
aaagacatat tcattctctg ctgcaaactt ggcatagctc tgctttgcct ggggctattg 180 
ggggaagttg cggttcgtgc tcgcagggct ctcacccttg actctt-ttaa tagctcttct 240 
gtgcaagatt acaatctaaa caattcggag aactcgacct tccfccctgag gcaaggacca 300 
cagccaactt cctcttacaa gccgcatcga ttttgtcctt cagaaataga aataagaatg 360 
cttgctaaaa attatatttt taccaataag accaatccaa taggtagatt attagttact 420 
atgttaagaa atgaatcatt atcttttagt actattttta ct-caaattca gaagttagaa 480 
atgggaatag aaaatagaaa gagacgctca acctcaattg aagaacaggt gcaaggacta 540 
ttgaccacag gcctagaagt aaaaaaggga aaaaagagtg tttttgtcaa aataggagac €00 
aggtggtggc aaccagggac ttatagggga ccttacatct acagaccaac agatgccccc 660 
ttaccatata caggaagata tgacttaaat tgggataggt gggttacagt caatggctat 720 
aaagtgttat atagatccct cccttttcgt gaaagactcg -ccagagctag acctccttgg 780 
tgtatgttgt ctcaagaaga aaaagacgac atgaaacaac aggtacatga ttatatttat 840 
ctaggaacag gaatgcactt ttggggaaag attttccata ccaaggaggg gacagtggct 900 
ggactaatag aacattattc tgcaaaaact catggcatga gttattatga atagcct^tta 960 
ttggcccaac cttgcggttc ccagggctta agtaagtttt tggttacaaa ctgtfccttaa 1020 
aacgaggatg tgagacaagt ggtttcctga cttggtttgg tatcaaaggt tctgatctga 1080 
gctctgagtg ttctattttc ctatgttctt ttggaattta tccaaatctt atgtaaatgc 1140 
ttatgtaaac caagatataa aagagtgctg attttttgag taaacttgca acagtcctaa 1200 
cattcacctc ttgtgtgttt gtgtctgttc gccatcccgt ctx:cgctcgt cactfcatcct 1260 
tcactttcca gagggtcccc ccgcagaccc cggcgaccct caggtcggcc gactgcggca 1320 
gctggcgccc gaacagggac cctcggataa gtgacccttg t<rt<:tat^tc tactatttgg 1380 
tgtttgtctt gtattgtctc tttcttgtct ggctatcatc acaagagcgg aacggactca 1440 
ccatagggac caagctagcg actgaaaatg agacatatta tctgccacgg aggtgttatt 1S00 
accgaagaaa tggccgccag tcttttggac cagctgatcg aagaggtact ggctgataat 1S60 
cttccacctc ctagccattt tgaaccacct acccttcacg aactgtatga tttagacgtg 1^20 
acggcccccg aagatcccaa cgaggaggcg gtttcgcaga ttttfccccga -ctctgtaatg 1680 
ttggcggtgc aggaagggat tgacttactc acttttccgc cggcgcccgg ttctccggag 1740 
ccgcctcacc tttcccggca gcccgagcag ccggagcaga gagccttggg tccggtttct 1800 
atgccaaacc ttgtaccgga ggtgatcgat cttacctgcc acgaggctgg ctttccaccc 1860 
agtgacgacg aggatgaaga gggtgaggag tttgtgfctag attatgtgga gcaccccggg 1920 
cacggttgca ggtcttgtca ttatcaccgg aggaatacgg gggacccaga tattatgtgt 1980 
tcgctttgct atatgaggac ctgtggcatg tttgtctaca gtaagtgaaa attatgggca 2040 
gtgggtgata gagtggtggg tttggtgtgg taattttttt tttaattttt acagttfctgt 2100 
ggtttaaaga attttgtatt gtgatttttt taaaaggtcc tgtgfcctgaa cctgagcctg 2160 
agcccgagcc agaaccggag cctgcaagac ctacccgccg tcctaaaatg gcgcctgcta 2220 
tcctgagacg cccgacatca cctgtgtcta gagaatgcaa tagtagtacg gatagct-gtg 2280 
actccggtcc ttctaacaca cctcctgaga tacacccggt ggtcrccgctg tgecccatta 2340 
aaccagttgc cgtgagagtt ggtgggcgtc gccaggctgt ggaatgtatc ^aggacttgc 2400 
ttaacgagcc tgggcaacct ttggacttga gctgtaaacg «ccccaggcca -taaggtgtaa 2460 
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acctgtgatt gcgtgtgtgg ttaacgcctt tgtttgctga atgagttgat gtaagtttaa 2520 
taaagggtga gataatgttt aacttgcatg gcgtgttaaa tggggcgggg cttaaagggt 2S80 
atataatgcg ccgtgggcta atcttggtta catctgacct catggaggct tgggagtgtt 2640 
tggaagattt ttctgctgtg cgtaacttgc tggaacagag ctctaacagt acctcttggt 2700 
tttggaggtt tctgtggggc tcatcccagg caaagttagt ctgcagaatt aaggaggatt 2760 
acaagtggga atttgaagag cttttgaaat cctgtggtga gctgtttgat tctttgaatc 2820 
tgggtcacca ggcgcttttc caagagaagg tcatcaagac tttggatttt tccacaccgg 2880 
ggcgcgctgc ggctgctgtt gcttttttga gttttataaa ggataaatgg agcgaagaaa 2940 
cccatctgag cggggggtac ctgctggatt ttctggccat gcatctgtgg agagcggttg 3O00 
tgagacacaa gaatcgcctg ctactgttgt cttccgtccg cccggcgata ataccgacgg 3060 
aggagcagca gcagcagcag gaggaagcca ggcggcggcg gcaggagcag agcccatgga 3120 
acccgagagc cggcctggac cctcgggaat gaatgttgta caggtggctg aactgtatcc 3180 
agaactgaga cgcattttga caattacaga ggatgggcag gggctaaagg gggtaaagag 3240 
ggagcggggg gcttgtgagg ctacagagga ggctaggaat ctagctttta gcttaatgac 3300 
cagacaccgt cctgagtgta ttacttttca acagatcaag gataa-ttgcg ctaatgagct 3360 
tgatctgctg gcgcagaagt attccataga gcagctgacc acttactggc tgcagccagg 3420 
ggatgatttt gaggaggcta ttagggtata tgcaaaggtg gcacttaggc cagattgcaa 3480 
gtacaagatc agcaaacttg taaatatcag gaattgttgc tacatttctg ggaacggggc 3540 
cgaggtggag atagatacgg aggatagggt ggcctttaga tgtagcatga taaatatgtg 3600 
gccgggggtg cttggcatgg acggggtggt tattatgaat gtaaggttta ctggccccaa 3660 
ttttagcggt acggttttcc tggccaatac caaccttatc ctacacggtg taagcttcta 3720 
tgggtttaac aatacctgtg tggaagcctg gaccgatgta agggttcggg gctgtgcctt 3780 
ttactgctgc tggaaggggg tggtgtgtcg ccccaaaagc agggcttcaa ttaagaaatg 3840 
cctctttgaa aggtgtacct tgggtatcct gtctgagggt aact-ccaggg tgcgccacaa 39O0 
tgtggcctcc gactgtggtt gcttcatgct agtgaaaagc gtggctgtga t-taagcataa 3960 
catggtatgt ggcaactgcg aggacagggc ctctcagatg ctgacctgct cggacggcaa 4020 
ctgtcacctg ctgaagacca ttcacgtagc cagccactct cgcaaggcct ggccagtgtt 4080 
tgagcataac atactgaccc gctgttcctt gcatttgggt aacaggaggg gggtgttcct 4140 
accttaccaa tgcaatttga gtcacactaa gatattgctt gagcccgaga gcatgtecaa 4200 
ggtgaacctg aacggggtgt ttgacatgac catgaagatc tggaaggtgc tgaggtacga 4260 
tgagacccgc accaggtgca gaccctgcga gtgtggcggt aaacatatta ggaaccagcc 4320 
tgtgatgctg gatgtgaccg aggagctgag gcccgatcac ttggtgctgg cctgcacccg 4380 
cgctgagttt ggctctagcg atgaagatac agattgaggt actgaaatgt gtgggcgtgg 4440 
cttaagggtg ggaaagaata tataaggtgg gggtcttatg tagttttgta tctgttttgc 4S00 
agcagccgcc gccgccatga gcaccaactc gtttgatgga agcattgtga gctcatattt 4S60 
gacaacgcgc atgcccccat gggccggggt gcgtcagaat gtgatgggct <ccagcattga 4620 
tggtcgcccc gtcctgcccg caaactctac taccttgacc tacgagaccg tgtctggaac 4680 
gccgttggag actgcagcct ccgccgccgc ttcagccgct gcagccaccg cccgcgggat 4740 
tgtgactgac tttgctttcc tgagcccgct tgcaagcagt gcagcttccc gtfccatccgc 4800 
ccgcgatgac aagttgacgg ctcttttggc acaattggat tctttgaccc gggaaottaa 4860 
tgtcgtttct cagcagctgt tggatctgcg ccagcaggtt -tctgccctga aggcttcctc 4920 
ccctcccaat gcggtttaaa acataaataa aaaaccagac tctgtttgga tttggatcaa 4980 
gcaagtgtct tgctgtctct cgagggatct ttgtgaagga accttacttc tgtggtgtga 5040 
cataattgga caaactacct acagagattt aaagctctaa ggtaaatata aaatttttaa 5100 
gtgtataatg tgttaaacta ctgattctaa ttgtttgtgt attttagatt ccaacctatg S160 
gaactgatga atgggagcag tggtggaatg cctttaatga ggaaaacctg ttttgctcag 5220 
aagaaatgcc atctagtgat gatgaggcta ctgctgactc tcaacattct acfcccfcecaa 5280 
aaaagaagag aaaggtagaa gaccccaagg actttccttc agaattgcta agttttttga 5340 
gtcatgctgt gtttagtaat agaactcttg cttgctttgc tatttacacc acaaaggaaa 5400 
aagctgcact gctatacaag aaaattatgg aaaaatatfcc tgtaaccttt ataagtaggc 5460 
ataacagtta taatcataac atactgtttt ttcttactcc acacaggcat agagtgtctg SS20 
ctattaataa ctatgctcaa aaattgtgta cctttagctt tttaatttgt aaaggggtta 5S80 
ataaggaata tttgatgtat agtgccttga ctagagatca taatcagcca 4;accacattt 5640 
gtagaggttt tacttgcttt aaaaaacctc ccacacctcc *ccctgaacct gaaacataaa 5700 
atgaatgcaa ttgttgttgt taacttgttt attgcagctt ataatggtta -caaataaagc 5760 
aatagcatca caaatttcac aaataaagca tttttttcac tgcattctag ttgtggtttg 5820 
tccaaactca tcaatgtatc ttatcatgtc tggatccggc tgtggaatgt gtgtcagt-ta 5880 
gggtgtggaa agtccccagg ctccccagca ggcagaagta tgcaaagcat gcatcfccaat 5940 
tagtcagcaa ccaggtgtgg aaagtcccca ggctccccag caggcagaag tatgcaaagc 6000 
atgcatctca attagtcagc aaccatagtc ccgcccctaa ctccgcccat cccgccccta «060 
actccgccca gttccgccca ttctccgccc catggctgac taattttttt: tattfcatgca "6120 
gaggccgagg ccgcctcggc -ctctgagcta ttccagaagt agtgaggagg cttttttgga -6180 
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ggcctaggct tttgcaaaaa gcttggacac aagacaggct tgcgagatat gtttgagaat 6240 
accactttat cccgcgtcag ggagaggcag tgcgtaaaaa gacgcggact catgtgaaat 6300 
actggttttt agtgcgccag atctctataa tctcgcgcaa cctattttcc cctcgaacac 6360 
tttttaagcc gtagataaac aggctgggac acttcacatg agcgaaaaat acatcgtcac "6420 
ctgggacatg ttgcagatcc atgcacgtaa actcgcaagc cgactgatgc cttctgaaca 6480 
atggaaaggc attattgccg taagccgtgg cggtctggta ccgggtgcgt tactggcgcg 6540 
tgaactgggt attcgtcatg tcgataccgt ttgtatttcc agctacgatc acgacaacca 6€00 
gcgcgagctt aaagtgctga aacgcgcaga aggcgatggc gaaggcttca tcgttattga €660 
tgacctggtg gataccggtg gtactgcggt tgcgattcgt gaaatgtatc caaaagcgca €720 
ctttgtcacc atcttcgcaa aaccggctgg tcgtccgctg gttgatgact atgttgttga €780 
tatcccgcaa gatacctgga ttgaacagcc gtgggatatg ggcgfccgtat £cgtcccgcc 6840 
aatctccggt cgctaatctt ttcaacgcct ggcactgccg ggcgfctgttc tttttaactt €900 
caggcgggtt acaatagttt ccagtaagta ttctggaggc tgcatccatg acacaggcaa 6960 
acctgagcga aaccctgttc aaaccccgct ttaaacatcc tgaaacctcg acgctagtcc 7020 
gccgctttaa tcacggcgca caaccgcctg tgcagtcggc ccttgatggt aaaaccatcc 7080 
ctcactggta tcgcatgatt aaccgtctga tgtggatctg gcgcggcatt gacccacgcg 7140 
aaatcctcga cgtccaggca cgtattgtga tgagcgatgc cgaacgtacc gacgatgatt 7200 
tatacgatac ggtgattggc taccgtggcg gcaactggat ttatgagtgg gccccggatc 72€0 
tttgtgaagg aaccttactt ctgtggtgtg acataattgg acaaactacc tacagagatt 7320 
taaagctcta aggtaaatat aaaattttta agtgtataat gtgttaaact actgattcta 7380 
attgtttgtg tattttagat tccaacctat ggaactgatg aatgggagca gtggtggaat 7440 
gcctttaatg aggaaaacct gttttgctca gaagaaatgc catctagtga tgatgaggct 7500 
actgctgact ctcaacattc tactcctcca aaaaagaaga gaaaggtaga agaccccaag 7560 
gactttcctt cagaattgct aagttttttg agtcatgctg tgtttagtaa tagaactctt 7620 
gcttgctttg ctatttacac cacaaaggaa aaagctgcac tgctatacaa gaaaattatg 7680 
gaaaaatatt ctgtaacctt tataagtagg cataacagtt ataatcataa <:at:actgttt 7740 
tttcttactc cacacaggca tagagtgtct gctattaata actatgctca aaaattgtgt 7800 
acctttagct ttttaatttg taaaggggtt aataaggaat atttgatgta -tagtgccttg 7860 
actagagatc ataatcagcc ataccacatt tgtagaggtt ttacttgctt taaaaaacct 7920 
cccacacctc cccctgaacc tgaaacataa aatgaatgca attgttgttg ttaacttgtt 7980 
tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 8040 
atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 8100 
ctggatcccc aggaagctcc tctgtgtcct cataaaccct aacctcctct acttgagagg 81€0 
acattccaat cataggctgc ccatccaccc tctgtgtcct cctgttaatt aggtcactta 8220 
acaaaaagga aattgggtag gggtttttca cagaccgctt tctaagggta attttaaaat 8280 
atctgggaag tcccttccac tgctgtgttc cagaagtgtt ggtaaacagc ccacaaatgt 8340 
caacagcaga aacatacaag ctgtcagctt tgcacaaggg -cccaacaccc tgcfccatcaa 8400 
gaagcactgt ggttgctgtg ttagtaatgt gcaaaacagg aggcacattt tccccacctg 8460 
tgtaggttcc aaaatatcta gtgttttcat ttttacttgg atcaggaacc -cagcactcca 8520 
ctggataagc attatcctta tccaaaacag ccttgtggtc agtgttcatc tgctgactgt 8S80 
caactgtagc attttttggg gttacagttt gagcaggata tttggfccctg tagtttgcta 8640 
acacaccctg cagctccaaa ggttccccac caacagcaaa aaaatgaaaa tttgaocctt 8700 
gaatgggttt tccagcacca ttttcatgag ttttttgtgt ccctgaatgc aagttfcaaca 87€0 
tagcagttac cccaataacc tcagttttaa cagtaacagc ttcccacatc aaaatattfcc 8820 
cacaggttaa gtcctcattt aaattaggca aaggaattct tgaagacgaa agggcctcgt 8880 
gatacgccta tttttatagg ttaatgtcat gataataatg gtttcttaga cgtcaggtgg 8940 
cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa 9000 
tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa 9060 
gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg catfcttgcct 9120 
tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 9180 
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg 9240 
ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt 9300 
atcccgtgtt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga 93€0 
cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 9420 
attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 9480 
gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg 9540 
ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac 9600 
gatgcctgca gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct 96€0 
agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct 9720 
gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 9780 
gtctcgcggt atcattgcag cactggggcc agatggtaag cccfccccgta tcgtagttat 9840 
ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagafccg ctgagatagg 9900 
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tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat 9960 
tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct 10020 
catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 10080 
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 10140 
aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 10200 
gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag ^gtagccgta 10260 
gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct 10320 
gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg. 10380 
atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca -cacagcccag 10440 
cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc 10S00 
cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg 10560 
agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 10620 
tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 10680 
gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc -cttttgctca 10740 
catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg -cctttgagtg 10800 
agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagfccagtga gcgaggaagc 10860 
ggaagagcgc ctgatgcggt attttctcct tacgcatctg tgcggtattt <:acaccgcat 10920 
accgcctcag aagccataga gcccaccgca tccccagcat gcctgctatt gtcttcccaa 10980 
tcctccccct tgctgtcctg ccccacccca ccccccagaa tagaatgaca cctactcaga 11040 
caatgcgatg caatttcctc attttattag gaaaggacag tgggagtggc accttccagg 11100 
gtcaaggaag gcacggggga ggggcaaaca acagatggct ggcaactaga aggcacagfcc 11160 
gaggctgatc agcgagctct agcatttagg tgacactata gaatagggcc <:tctagatgc 11220 
atgctcgagc ggccgcttct ttattcttgg gcaatgtatg aaaaagtgta agaggatgtg 11280 
gcaaatattt cattaatgta gttgtggcca gaccagtccc atgaaaatga catagagtat 11340 
gcacttggag ttgtgtctcc tgtttcctgt gtaccgttta gtgtaatggt tagtgfctaca 11400 
ggtttagttt tgtctccgtt taagtaaact tgactgacaa tgttactttt ggcagtttta 11460 
ccgtgagatt ttggataagc tgataggtta ggcataaatc caacagcgtt tgtataggct 11520 
gtgccttcag taagatctcc atttctaaag ttccaatatt ctgggtccag gaaggaattg 11580 
tttagtagca ctccattttc gtcaaatctt ataataagat gagcacfcttg aactgttcca 11640 
gatattggag ccaaactgcc tttaacagcc aaaactgaaa ctgtagcaag tatt-tgactg 11700 
ccacattttg ttaagaccaa agtgagttta gcatctttct ctgcatttag tctacagtta 11760 
ggagatggag ctggtgtggt ccacaaagtt agcttatcat tatttttgtt tcctactgta 11820 
atggcacctg tgctgtcaaa actaaggcca gttcctagtt taggaaccat agcctfcgttt 11880 
gaatcaaatt ctaggccatg gccaattttt gttttgaggg gatttgtgtt tggtgcatta 11940 
ggtgaaccaa attcaagccc atctcctgca ttaatggcta tggctgtagc gtcaaacatc 12000 
aaccccttgg cagtgcttag gttaacctca agctttttgg aattgtttga agctgtaaac 12060 
aagtaaaggc ctttgttgta gttaatatcc aagttgtggg ctgagtttat aaaaagaggg 12120 
ccctgtccta gtcttagatt tagttggttt tgagcatcaa acggataact aacatcaagt 12180 
ataaggcgtc tgttttgaga atcaatcctt agtcctcctg ctacattaag ttgcatattg 12240 
ccttgtgaat caaaacccaa ggctccagta actttagttt gcaaggaagt attattaata 12300 
gtcacacctg gaccagttgc tacggtcaaa gtgtttaggt cgtctgttac atgcaaagga 12360 
gccccgtact ttagtcctag ttttccattt tgtgtataaa tgggctcttt caagtcaatg 12420 
cccaagctac cagtggcagt agttagaggg ggtgaggcag ^gatagtaag ggtactgcta 12480 
tcggtggtgg tgagggggcc tgatgtttgc agggctagct ttccttctga cactgtgagg 12540 
ggtccttggg tggcaatgct aagtttggag tcgtgcacgg ttagcggggc ctgtgattgc 12600 
atggtgagtg tgttgcccgc gaccattaga ggtgcggcgg <:agccacagt tagggcttct 12660 
gaggtaactg tgaggggtgc agatatttcc aggtttatgt ttgacttggt ttttttgaga 12720 
ggtgggctca cagtggttac attttgggag gtaaggttgc cggcctcgtc cagagagagg 12780 
ccgttgccca ttttgagcgc aagcatgcca ttggaggtaa ctagaggttc ggataggcgc 12840 
aaagagagta ccccaggggg actctcttga aacccattgg gggatacaaa gggaggagta 12900 
agaaaaggca cagttggagg accggtttcc gtgtcatatg gatacacggg gttgaaggta 12960 
tcttcagacg gtcttgcgcg cttcatcttg gatctcaagc ctgccacacc tcacctcgac 13020 
catccgccgt ctcaagaccg cctactttaa ttacatcatc agcagcacct ecgccagaaa 13080 
caaccccgac cgccacccgc tgccgcccgc cacggtgctc agcctacctt gegactgtga 13140 
ctggttagac gcctttctcg agaggttttc cgatccggtc gatgcggact xrgctcaggtc 13200 
cctcggtggc ggagtaccgt tcggaggccg acgggtttcc gatccaagag tactggaaag 13260 
accgcgaaga gtttgtcctc aaccgcgagc ccaacagcga gctegaattc agatccgagc 13320 
tcggtaccaa gcttgggtct ccctatagtg agtcgtatta atttcgataa gccagtaagc 13380 
agtgggttct ctagttagcc agagagctct gcttatatag acctcccacc gtacacgcct 13440 
accgcccatt tgcgtcaatg gggcggagtt gttacgacat tttggaaagt cccgttgatt 13500 
ttggtgccaa aacaaactcc cattgacgtc aatggggtgg agacttggaa atccecgtga 135€0 
gtcaaaccgc tatccacgcc cattgatgta ctgccaaaac cgcatcacca tggtaatagc 13620 
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gatgactaat acgtagatgt actgccaagt aggaaagtcc cataaggfcca tgtactgggc 13680 
ataatgccag gcgggccatt taccgtcatt gacgtcaata gggggcgtac ttggcatatg 13740 
atacacttga tgtactgcca agtgggcagt ttaccgtaaa tagtccaccc attgacgtca 13800 
atggaaagtc cctattggcg ttactatggg aacatacgtc attattgacg tcaatgggcg 13660 
ggggtcgttg ggcggtcagc caggcgggcc atttaccgta agttatgtaa cgcggaactc 13920 
catatatggg ctatgaacta atgaccccgt aattgattac tattaataac tagtcaataa 13980 
tcaatgtcaa cgcgtatatc tggcccgtac atcgcgaagc agcgcaaaac gcctaaccct 14040 
aagcagattc ttcatgcaat tgtcggtcaa gccttgrott. gttgtagctt aaatt-ttgct 14100 
cgcgcactac tcagcgacct ccaacacaca agcagggagc agatactggc ttaactatgc 14160 
ggcatcagag cagattgtac tgagagtcga ccatagggga tcgggagatc tcccgatccg 14220 
tctatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agtatacact 14280 
ccgctatcgc tacgtgactg ggtcatggct gcgccccgac acccgccaac acccgctgac 14340 
gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc 14400 
gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag gcagc 144SS 



<210> 16 
<211> 10610 - 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 16 

gacggatcgg gagatccgcg cggtacacag aattcaggag acacaactcc aagtgcatac 60 
tctatgtcat tttcatggga ctggtctggc cacaactaca ttaatgaaat atttgccaca 120 
tcctcttaca ctttttcata cattgcccaa gaataaagaa tcgtttgtgt tatgtttcaa 180 
cgtgtttatt tttcaattgc agaaaatttc aagtcatttt tcattcagta gtatagcccc 240 
accaccacat agcttataca gatcaccgta ccttaatcaa actcacagaa ccctagtatt 300 
caacctgcca cctccctccc aacacacaga gtacacagtc ctttctcccc ggctggcctt 360 
aaaaagcatc atatcatggg taacagacat attcttaggt gttatattcc acacggttto 420 
ctgtcgagcc aaacgctcat cagtgatatt aataaactcc ocgggcagct cacttaagtt 480 
catgtcgctg tccagctgct gagccacagg ctgctgtcca acttgcggtt gcttaacggg 540 
cggcgaagga gaagtccacg cctacatggg ggtagagtca taatcgtgca tcaggatagg 600 
gcggtggtgc tgcagcagcg cgcgaataaa ctgctgccgc cgccgctccg tcctgcagga 660 
atacaacatg gcagtggtct cctcagcgat gattcgcacc gcccgcagca taaggcgcct 720 
tgtcctccgg gcacagcagc gcaccctgat ctcacttaaa tcagcacagt aactgcagca 780 
cagcaccaca atattgttca aaatcccaca gtgcaaggcg ctgtafcccaa agctcatggc 840 
ggggaccaca gaacccacgt ggccatcata ccacaagcgc aggtagatta agtggcgacc 900 
cctcataaac acgctggaca taaacattac ctcttttggc atgttgtaat tcaccacctc 960 
ccggtaccat ataaacctct gattaaacat ggcgccatcc accaccatcc taaaccagct 1020 
ggccaaaacc tgcccgccgg ctatacactg cagggaaccg ggactggaac aatgacagtg 1080 
gagagcccag gactcgtaac catggatcat catgctcgtc atgatafccaa *gttggcaca 1140 
acacaggcac acgtgcatac acttcctcag gattacaagc tcctcccgcg ttagaaccat 1200 
atcccaggga acaacccatt cctgaatcag cgtaaatccc acactgcagg gaagacctcg 1260 
cacgtaactc acgttgtgca ttgtcaaagt gttacattcg ggcagcagcg gatgatcctc 1320 
cagtatggta gcgcgggttt ctgtctcaaa aggaggtaga xrgatccctac tgtacggagt 1380 
gcgccgagac aaccgagatc gtgttggtcg tagtgtcatg ccaaatggaa cgccggacgt 1440 
agtcatattt cctgaagcaa aaccaggtgc gggcgtgaca aacagatctg cgtctccggt 1500 
ctcgccgctt agatcgctct gtgtagtagt tgtagtatat ocactctctc aaagcatcca 1560 
ggcgccccct ggcttcgggt tctatgtaaa ctccttcatg cgccgctgcc ctgataacat 1620 
ccaccaccgc agaataagcc acacccagcc aacctacaca ttcgttctgc gagtcacaca 1680 
cgggaggagc gggaagagct ggaagaacca tgtttttttt tttattccaa aagattatcc 1740 
aaaacctcaa aatgaagatc tattaagtga acgcgctccc ctccggtggc gtggtcaaac 1800 
tctacagcca aagaacagat aatggcattt gtaagatgtt gcacaatggc ttccaaaagg 1860 
caaacggccc tcacgtccaa gtggacgtaa aggctaaacc ctfccagggtg aatctcctct 1920 
ataaacattc cagcaccttc aaccatgccc aaataattct catctcgcca <ccttctcaat 1980 
atatctctaa gcaaatcccg aatattaagt ccggccattg taaaaatctg cfcccagagcg 2040 
ccctccacct tcagcctcaa gcagcgaatc atgattgcaa aaattcaggt tcctcacaga 2100 
cctgtataag attcaaaagc ggaacattaa caaaaatacc gcgafccccgt aggtcccttc 2160 
gcagggccag ctgaacataa tcgtgcaggt ctgcacggac cagcgcggcc acttccccgc -2220 
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caggaacctt gacaaaagaa cccacactga 
ccagcgtagc cccgatgtaa gctttgttgc 
ctcaaaaaat caggcaaagc ctcgcgcaaa 
agataaaggc aggtaagctc cggaaccacc 
atgtctgcgg gtttctgcat aaacacaaaa 
aagcctgtct tacaacagga aaaacaaccc 
cggcgtgacc gtaaaaaaac tggtcaccgt 
tcatgtccgg agtcataatg taagactcgg 
gctaaaaagc gaccgaaata gcccggggga 
acagccccca taggaggtat aacaaaatta 
aaaccctcct gcctaggcaa aatagcaccc 
cagcggcagc ctaacagtca gccttaccag 
ctcgacacgg caccagctca atcagtcaca 
atatatagga ctaaaaaatg acgtaacggt 
cacgcgaacc tacgcccaga aacgaaagcc 
ttccgttttc ccacgttacg taacttcccg 
tcagtacaat ctgctctgat gccgcatagt 
tggaggtcgc tgagtagtgc gcgagcaaaa 
acaattgcat gaagaatctg cttagggtta 
ccagatatac gcgttgacat tgattattga 
cattagttca tagcccatat atggagttcc 
ctggctgacc gcccaacgac ccccgcccat 
taacgccaat agggactttc cattgacgtc 
acttggcagt acatcaagtg tatcatatgc 
gtaaatggcc cgcctggcat tatgcccagt 
agtacatcta cgtattagtc atcgctatta 
atgggcgtgg atagcggttt gactcacggg 
atgggagttt gttttggcac caaaatcaac 
ccccattgac gcaaatgggc ggtaggcgtg 
tctggctaac tagagaaccc actgcttact 
gggagaccca agcttggtac cgagctcgga 
gttgaggaca aactcttcgc ggtctttcca 
cgaacggtac tccgccaccg agggacctga 
tctcgagaaa ggcgtctaac cagtcacagt 
gcagcgggtg gcggtcgggg ttgtttctgg 
aggcggtctt gagacggcgg atggtcgagg 
aagcgcgcaa gaccgtctga agataccttc 
ggtcctccaa ctgtgccttt tcttactcct 
agtccccctg gggtactctc tttgcgccta 
cttgcgctca aaatgggcaa cggcctctct 
aatgtaacca ctgtgagccc acctctcaaa 
tctgcacccc tcacagttac ctcagaagcc 
gtcgcgggca acacactcac catgcaatca 
cttagcattg ccacccaagg acccctcaca 
tcaggccccc tcaccaccac cgatagcagt 
actactgcca ctggtagctt gggcattgac 
aaactaggac taaagtacgg ggctcctttg 
gtagcaactg gtccaggtgt gactattaat 
gccttgggtt ttgattcaca aggcaatatg 
gattctcaaa acagacgcct tatacttgat 
ctaaatctaa gactaggaca gggccctctt 
aactacaaca aaggccttta cttgtttaca 
aacctaagca ctgccaaggg gttgatgttt 
gatgggcttg aatttggttc acctaatgca 
ggccatggcc tagaatttga ttcaaacaag 
agttttgaca gcacaggtgc cattacagta 
tggaccacac cagctccatc tcctaactgt 
actttggtct taacaaaatg tggcagtcaa 
aaaggcagtt tggctccaat atctggaaca 
gacgaaaatg gagtgctact aaacaattcc 
aatggagatc ttactgaagg cacagcctat 
tcagcttatc caaaatctca cggtaaaact 



ttatgacacg catactcgga gctatgctaa 2280 
atgggcggcg atataaaatg caaggtgctg 2340 
aaagaaagca catcgtagtc atgctcatgc 2400 
acagaaaaag acaccatttt tctctcaaac 24^60 
taaaataaca aaaaaacatt taaacattag 2520 
ttataagcat aagacggact acggccatgc 2580 
gattaaaaag caccaccgac agctcctcgg 2640 
taaacacatc aggttgattc atcggccagt 2700 
atacataccc gcaggcgtag agacaacatt 27*60 
ataggagaga aaaacacata aacacctgaa 2820 
tcccgctcca gaacaacata -cagcgcttca 2880 
taaaaaagaa aacctattaa aaaaacacca 2940 
gtgtaaaaaa gggccaagtg cagagcgagt 3000 
taaagtccac aaaaaacacc cagaaaaccg 306 0 
aaaaaaccca caacttcctc aaatcgtcac 3120 
gatcctctcc cgatccccta tggccgactc 31*80 
taagccagta tctgctccct gcttgtgtgt 3240 
tttaagctac aacaaggcaa ggcttgaccg 3300 
ggcgttttgc gctgcttcgc gatgcacggg 3360 
ctagttatta atagtaatca attacggggt 3420 
gcgttacata acttacggta aatggcccgc 3480 
tgacgtcaat aatgacgtat gttcocatag 354 0 
aatgggtgga ctatttacgg taaactgccc 3600 
caagtacgcc ccctattgac gtcaatgacg 3660 
acatgacctt atgggacttt cctacttggc 3720 
ccatggtgat gcggttttgg cagtacatca 3780 
gatttccaag tctgcacccc attgacgtca 3840 
gggactttcc aaaatgtcgt aacaactccg 3900 
tacggtggga ggtctatata agcagagctc 3960 
ggctcatcga aattaatacg actcactata 4020 
tctgaattcg agctcgctgt tgggctcgcg 4080 
gtactcttgg atcggaaacc cgtcggcctc 4140 
gcgagtccgc atcgaccgga tcggaaaacc 4200 
cgcaaggcag gctgagcacc gtggcgggcg 4260 
cggaggtgct gctgatgatg taattaaagt 4320 
tgaggtgtgg caggcttgag atccaagatg 4380 
aaccccgtgt atccatatga cacggaaacc 4440 
ccctttgtat cccccaatgg gttccaagag 4500 
tccgaacctc tagttacctc caatggcatg 4560 
ctggacgagg ccggcaacct tacctcccaa 4620 
aaaaccaagt caaacataaa cctggaaata 4680 
ctaactgtgg ctgccgccgc acctccaatg 4740 
caggccccgc taaccgtgca cgactccaaa 4800 
gtgtcagaag gaaagctagc cctgcaaaca 4860 
acccttacta tcactgcctc accccctcta 4920 
ttgaaagagc ccatt-ta-tac acaaaatgga 4980 
catgtaacag acgacctaaa cactttgacc SO 40 
aatacttcct tgcaaactaa agttactgga 5100 
caacttaatg tagcaggagg actaaggatc 5150 
gttagttatc cgtfctgatgc t-caaaaccaa 5220 
tttataaact cagcccacaa cttggatatt S280 
gcttcaaaca attccaaaaa gcttgaggtt 5340 
gacgctacag ccatagccat taatgcagga 5400 
ccaaacacaa atcccctcaa aacaaaaatt 5460 
gctatggttc ctaaactagg aaccggcctt 5520 
ggaaacaaaa ataatgataa gctaactttg 5580 
agactaaatg cagagaaaga tgctaaactc 5640 
atacttgcta cagtttcagt tittggccgtt 5700 
gttcaaagtg ctcatcttac tataagattt 5760 
ttcctggacc cagaatattg gaactttaga 5820 
acaaacgctg ttggatttat gcctaaccta 5880 
gccaaaagca acafctgtcag -tcaagtttac 5940 
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ttaaacggag acaaaactaa acctgtaaca ctaaccatta cactaaacgg tacacaggaa 6000 
acaggagaca caactccaag tgcatactct atgtcatttt catgggactg gtctggecac 6060 
aactacatta atgaaatatt tgccacatcc tcttacactt tttcatacat tgcccaagaa 6120 
taaagaagcg gccgctcgag catgcatcta gagggcccta ttctatagtg tcacctaaat 6180 
gctagagctc gctgatcagc ctcgactgtg ccttctagtt gccagccatc tgttgtttgc 6240 
ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct -ttcctaataa ^300 
aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg -63*60 
gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg ge a ^gcggtg -6420 
ggctctatgg cttctgaggc ggaaagaacc agctggggct ctagggggta tccccacgcg S480 
ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca '6540 
cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct cgccacgttc 6€00 
gccggctttc cccgtcaagc tctaaatcgg ggcatccctt tagggttccg atttagtgct 6660 
ttacggcacc tcgaccccaa aaaacttgat tagggtgatg gttcacgtag tgggccatcg 6720 
ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa tagtggacfcc 6780 
ttgttccaaa ctggaacaac actcaaccct atctcggtct attcttttga tttataaggg *840 
attttgggga tttcggccta ttggttaaaa aatgagctga tttaacaaaa atttaacgcg 6900 
aattaattct gtggaatgtg tgtcagttag ggtgtggaaa gtccccaggc tccccaggca 6960 
ggcagaagta tgcaaagcat gcatctcaat tagtcagcaa ccaggtgtgg aaagt<xx:ca 7020 
ggctccccag caggcagaag tatgcaaagc atgcatctca attagtcagc aaccatagtc 7080 
ccgcccctaa ctccgcccat cccgccccta actccgccca gttccgccca ttctccgccc 7140 
catggctgac taattttttt tatttatgca gaggccgagg ccgcctctgc ctctgagcta 7200 
ttccagaagt agtgaggagg cttttttgga ggcctaggct tttgcaaaaa gctcccggga 7260 
gcttgtatat ccattttcgg atctgatcaa gagacaggat gaggatcgtt tcgcatgatt 7320 
gaacaagatg gattgcacgc aggttctccg gccgcttggg tggagaggct atfccggctat 7380 
gactgggcac aacagacaat cggctgctct gatgccgccg tgttccggct gtcagcgcag 7440 
gggcgcccgg ttctttttgt caagaccgac ctgtccggtg ccctgaatga actgcaggac 7500 
gaggcagcgc ggctatcgtg gctggccacg acgggcgttc cttgcgcagc tgtgctcgac 7560 
gttgtcactg aagcgggaag ggactggctg ctattgggcg aagtgccggg gcaggafcctc 7620 
ctgtcatctc accttgctcc tgccgagaaa gtatccatca tggctgatgc aatgcggcgg 7680 
ctgcatacgc ttgatccggc tacctgccca ttcgaccacc aagcgaaaca -tcgcatcgag 7740 
cgagcacgta ctcggatgga agccggtctt gtcgatcagg atgatctgga -cgaagagcat 7800 
caggggctcg cgccagccga actgttcgcc aggctcaagg cgcgcatgcc cgacggcgag 78€0 
gatctcgtcg tgacccatgg cgatgcctgc ttgccgaata tcatggtgga aaatggccgc 7920 
ttttctggat tcatcgactg tggccggctg ggtgtggcgg accgctatca ggacatagcg 7980 
ttggctaccc gtgatattgc tgaagagctt ggcggcgaat gggctgaccg -ctfccctcgtg 8040 
ctttacggta tcgccgctcc cgattcgcag cgcatcgcct tctatcgcct tcttgacgag 8100 
ttcttctgag cgggactctg gggttcgaaa tgaccgacca agcgacgccc aacctgccat 81€0 
cacgagattt cgattccacc gccgccttct atgaaaggtt gggctfccgga afccgttttcc 8220 
gggacgccgg ctggatgatc ctccagcgcg gggatctcat gctggagtt*: ttcgcccacc 8280 
ccaacttgtt tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca 8340 
caaataaagc atttttttca ctgcattcta gttgtggttt gfcccaaactc atcaatgtat 8400 
cttatcatgt ctgtataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 84€0 
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 8520 
taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 8580 
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 8640 
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgaetcgc 8700 
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 8760 
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 8820 
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc -ccccctgacg 8880 
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 8940 
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc -ctgecgetta 9000 
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttcfccaa tgcfccacgct 9060 
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 9120 
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 9180 
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 9240 
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag 9300 
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagetctt 93*60 
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 9420 
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg ^ctgacgctc 9460 
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatct^ca 9S40 
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 9600 
cttggtctga cagttaccaa tgcttaatca gtgaggcacc 4;a-fcctcagcg atctgtctat 9660 
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ttcgttcatc 
taccatctgg 
tatcagcaat 
ccgcctccat 
atagtttgcg 
gtatggcttc 
tgtgcaaaaa 
cagtgttatc 
taagatgctt 
ggcgaccgag 
ctttaaaagt 
cgctgttgag 
ttactttcac 
gaataagggc 
gcatttatca 
aacaaatagg 



catagttgcc 
ccccagtgct 
aaaccagcca 
ccagtctatt 
caacgttgtt 
attcagctcc 
agcggttagc 
actcatggtt 
ttctgtgact 
ttgctcttgc 
gctcatcatt 
atccagttcg 
cagcgtttct 
gacacggaaa 
gggttattgt 
ggttccgcgc 



tgactccccg 
gcaatgatac 
gccggaaggg 
aattgttgcc 
gccattgcta 
ggttcccaac 
tccttcggtc 
atggcagcac 
ggtgagtact 
ccggcgtcaa 
ggaaaacgtt 
atgtaaccca 
gggtgagcaa 
tgttgaatac 
ctcatgagcg 
acatttcccc 



tcgtgtagat 
cgcgagaccc 
ccgagcgcag 
gggaagctag 
caggcatcgt 
gatcaaggcg 
ctccgatcgt 
tgcataattc 
caaccaagtc 
tacgggataa 
cttcggggcg 
ctcgtgcacc 
aaacaggaag 
tcatactctt 
gatacatatt 
gaaaagtgcc 



aactacgata 
acgctcaccg 
aagtggtcct 
agtaagtagt 
ggtgtcacgc 
agttacatga 
tgtcagaagt 
tcttactgtc 
attctgagaa 
taccgcgcca 
aaaactctca 
caactgatct 
gcaaaatgcc 
cctttttcaa 
tgaatgtatt 
acctgacgtc 



cgggagggct 
gctccagatt 
gcaactttat 
tcgccagtta 
tcgtcgtttg 
tcccccatgt 
aagttggccg 
atgccatccg 
tagtgtatgc 
catagcagaa 
aggatcttac 
tcagcatctt 
gcaaaaaagg 
tattattgaa 
tagaaaaata 



9720 

9780 

9840 

99O0 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10S00 

10S60 

10610 



<210> 17 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 17 

tgtacaccgg atccggcgca cacc 24 



<210> 18 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 18 

cacaacgagc tcaattaatt aattgccaca tcctc 35 

<210> 19 

<211> 4 

<212> PRT 

<213> adenovirus 

<400> 19 
Thr Leu Trp Thr 
1 

<210> 20 

<211> 12 

<212> PRT 

<213> adenovirus 

<400> 20 

Pro Ser Ala Ser Ala Ser Ala Ser Ala Pro <31y Ser 
15 10 

<210> 22 
<211> 327 
<212> DNA 
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<213> adenovirus 
<400> 22 

agatctgaat tcgagctcgc tgttgggctc gcggttgagg acaaactctt cgcggtcttt 60 
ccagtactct tggatcggaa acccgtcggc ctccgaacgg tactccgcca t:cgagggacc 120 
tgagcgagtc cgcatcgacc ggatcggaaa acctctcgag aaaggcgtct aaccagfccac 180 
agtcgcaagg taggctgagc accgtggcgg gcggcagcgg gtggcggtcg gggttgtttc 240 
tggcggaggt gctgctgatg atgtaattaa agtaggcggt cttgagacgg <:ggatggfccg 300 
aggtgaggtg tggcaggctt gagatct 327 

<210> 23 
<211> 32480 
<212> DNA 
<213> adenovirus 

<400> 23 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt €0. 
ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 
gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 
gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 240 
taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 
agtgaaatct gaataatttt gtgttactca tagcgcgtaa tctctagcat cgatgtcgac 360 
aagcttgaat tcgattaatg tgagttagct cactcattag gcaccccagg ctttacactt 420 
tatgcttccg gctcgtatgt tgtgtggaat tgtgagcgga taacaatttc acaeaggaaa 480 
cagctatgac catgattacg aattcggcgc agcaccatgg cctgaaataa octctgaaag S40 
aggaacttgg ttaggtacct tctgaggcgg aaagaaccag ^ctgtggaatg tgtgfccagtt 600 
agggtgtgga aagtccccag gctccccagc aggcagaagt atgcaaagca tgcatctcaa €60 
ttagtcagca accaggtgtg gaaagtcccc aggctcccca gcaggcagaa gtatgcaaag 720 
catgcatctc aattagtcag caaccatagt cccgccccta actccgccca tcccgcccct 780 
aactccgccc agttccgccc attctccgcc ccatggctga ctaatttttt fctatttatgc 840 
agaggccgag gccgcctcgg cctctgagct attccagaag 4:agtgaggag gcttttttgg 900 
aggcctaggc ttttgcaaaa agcttgggat ctctataatc tcgcgcaacc tattttcccc 960 
tcgaacactt tttaagccgt agataaacag gctgggacac ttcacatgag -cgaaaaatac 1020 
atcgtcacct gggacatgtt gcagatccat gcacgtaaac tcgcaagccg actgatgect 1080 
tctgaacaat ggaaaggcat tattgccgta agccgtggcg gtctggtacc ggtgggtgaa 1140 
gaccagaaac agcacctcga actgagccgc gatattgccc agcgtttcaa cgcgctgtat 1200 
ggcgagatcg atcccgtcgt tttacaacgt cgtgactggg aaaaccctgg .cgttacccaa 1260 
cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga agaggcccgc 1320 
accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatggcgctt tgcctggttt 1380 
ccggcaccag aagcggtgcc ggaaagctgg ctggagtgcg atcttcctga ggccgatact 1440 
gtcgtcgtcc cctcaaactg gcagatgcac ggttacgatg cgcccatcta caccaacgta 1500 
acctatccca ttacggtcaa tccgccgttt gttcccacgg agaatccgac gggttgttac 1S60 
tcgctcacat ttaatgttga tgaaagctgg ctacaggaag gccagacgcg aattattttt 1*620 
gatggcgtta actcggcgtt tcatctgtgg tgcaacgggc gctgggtcgg ttacggccag 1*680 
gacagtcgtt tgccgtctga atttgacctg agcgcatttt tacgcgccgg agaaaaccgc 1740 
ctcgcggtga tggtgctgcg ttggagtgac ggcagttatc tggaagatca ggafcatgtgg 1800 
cggatgagcg gcattttccg tgacgtctcg ttgctgcata aaccgactac acaaatcagc 18€0 
gatttccatg ttgccactcg ctttaatgat gatttcagcc gcgctgtact ggaggctgaa 1920 
gttcagatgt gcggcgagtt gcgtgactac ctacgggtaa cagtttcttt atggcagggt 1980 
gaaacgcagg tcgccagcgg caccgcgcct ttcggcggtg aaattatcga tgagcgtggt 204O 
ggttatgccg atcgcgtcac actacgtctg aacgtcgaaa acccgaaact gtggagcgcc 2100 
gaaatcccga atctctatcg tgcggtggtt gaactgcaca ccgccgacgg cacgctgatt 2160 
gaagcagaag cctgcgatgt cggtttccgc gaggtgcgga ttgaaaatgg tctgctgetg 2220 
ctgaacggca agccgttgct gattcgaggc gttaaccgtc acgagcabca tcctctgcat 2280 
ggtcaggtca tggatgagca gacgatggtg caggatatcc tgctgatgaa gcagaacaac 2340 
tttaacgccg tgcgctgttc gcattatccg aaccatccgc *gtggtacac gctgtgcgac 2400 
cgctacggcc tgtatgtggt ggatgaagcc aatattgaaa cccacggcat ggtgccaatg 2460 
aatcgtctga ccgatgatcc gcgctggcta ccggcgatga gcgaacgcgt aacgcgaatg 2520 
gtgcagcgcg atcgtaatca cccgagtgtg atcatctggt cgctggggaa tgaatcaggc 2580 
cacggcgcta atcacgacgc gctgtatcgc tggatcaaat ctgt<:gatcc ttcccgcccg 2640 
gtgcagtatg aaggcggcgg agccgacacc acggccaccg atattatttg <:ccgatgtac 2700 
gcgcgcgtgg atgaagacca gcccttcccg gctgtgccga aatggtccat caaaaaatgg 27€0 
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ctttcgctac ctggagagac gcgcccgctg atcctttgcg aatacgccca cgcgatgggt 2€20 
aacagtcttg gcggtttcgc taaatactgg caggcgtttc gtcagtatcc ccgtttacag 2880 
ggcggcttcg tctgggactg ggtggatcag tcgctgatta aatatgatga aaacggcaac 2940 
ccgtggtcgg cttacggcgg tgattttggc gatacgccga acgat<:gcca gttctgtatg 3000 
aacggtctgg tctttgccga ccgcacgccg catccagcgc tgacggaagc aaaacaccag 306 0 
cagcagtttt tccagttccg tttatccggg caaaccatcg aagtgaccag cgaatacctg 3120 
ttccgtcata gcgataacga gctcctgcac tggatggtgg cgctggatgg taagccgctg 3180 
gcaagcggtg aagtgcctct ggatgtcgct ccacaaggta aacagttgat tgaactgcct 3240 
gaactaccgc agccggagag cgccgggcaa ctctggctca cagtacgcgt agtgcaaccg 3300 
aacgcgaccg catggtcaga agccgggcac atcagcgcct ggcagcagtg gcgtctggcg 3360 
gaaaacctca gtgtgacgct ccccgccgcg tcccacgoca tcccgcafcct gaccaccagc 3420 
gaaatggatt tttgcatcga gctgggtaat aagcgttggc aatttaaccg ccagtcaggc 3480 
tttctttcac agatgtggat tggcgataaa aaacaactgc tgacgccgct gcgcgatcag 3S40 
ttcacccgtg caccgctgga taacgacatt ggcgtaagtg aagcgacccg -cattgaccct 3600 
aacgcctggg tcgaacgctg gaaggcggcg ggccattacc aggccgaagc agcgttgttg 3660 
cagtgcacgg cagatacact tgctgatgcg gtgctgatta cgaccgcfcca <:gcgtggcag 3720 
catcagggga aaaccttatt tatcagccgg aaaacctacc ggattgatgg tagtggtcaa 3780 
atggcgatta ccgttgatgt tgaagtggcg agcgatacac cgcatccggc gcggattggc 3840 
ctgaactgcc agctggcgca ggtagcagag cgggtaaact ggctcggatt agggccgcaa 3900 
gaaaactatc ccgaccgcct tactgccgcc tgttttgacc gctgggatct gccattgtca 3960 
gacatgtata ccccgtacgt cttcccgagc gaaaacggtc tgcgctgcgg gacgcgcgaa 4020 
ttgaattatg gcccacacca gtggcgcggc gacttccagt tcaacatcag ccgctacagt 4080 
caacagcaac tgatggaaac cagccatcgc catctgctgc acgcggaaga aggcacatgg 4140 
ctgaatatcg acggtttcca tatggggatt ggtggcgacg actcctggag cccgtcagta 4200 
tcggcggaat tccagctgag cgccggtcgc taccattacc agttggtctg gtgtcaaaaa 42*60 
taataataac cgggcaggcc atgtctgccc gtatttcgcg taaggaaatc cattatgtac 4320 
tatttaaaaa acacaaactt ttggatgttc ggtttattct tttfcctttta ctttt^fctatc 4380 
atgggagcct acttcccgtt tttcccgatt tggctacatg acatcaacca -tatcagcaaa 4440 
agtgatacgg gtattatttt tgccgctatt tctctgttct cgctattatt ccaaccgetg 4500 
tttggtctgc tttctgacaa actcggaact tgtttattgc agcttataat ggttacaaat 4560 
aaagcaatag catcacaaat ttcacaaata aagcattttt ttcactgcat tctagttgtg 4620 
gtttgtccaa actcatcaat gtatcttatc atgtctggat ccagatctgg gcgtggctta 4680 
agggtgggaa agaatatata aggtgggggt cttatgtagt tttgtatctg ttttgcagca 4740 
gccgccgccg ccatgagcac caactcgttt gatggaagca ttgtgagctc atatttgaca 4800 
acgcgcatgc ccccatgggc cggggtgcgt cagaatgtga tgggctccag cattgatggt 4860 
cgccccgtcc tgcccgcaaa ctctactacc ttgacctacg agaccgtgtc tggaacgccg 4920 
ttggagactg cagcctccgc cgccgcttca gccgctgcag ccaccgcccg ogggattgtg 4980 
actgactttg ctttcctgag cccgcttgca agcagtgcag cttcccgttc atccgcecgc 5040 
gatgacaagt tgacggctct tttggcacaa ttggattctt tgacccggga acttaatgtc 5100 
gtttctcagc agctgttgga tctgcgccag caggtttctg ccctgaaggc titcctcccct 5160 
cccaatgcgg tttaaaacat aaataaaaaa ccagactctg tttggatttg gatcaagcaa 5220 
gtgtcttgct gtctttattt aggggttttg cgcgcgcggt aggcccggga ccagcggtct 5280 
cggtcgttga gggtcctgtg tattttttcc aggacgtggt aaaggtgact ctggatgttc 5340 
agatacatgg gcataagccc gtctctgggg tggaggtagc accactgcag agctt:<:atgc 5400 
tgcggggtgg tgttgtagat gatccagtcg tagcaggagc gctgggcgtg gtgcctaaaa 5460 
atgtctttca gtagcaagct gattgccagg ggcaggccct tggtgtaagt gtttacaaag 5520 
cggttaagct gggatgggtg catacgtggg gatatgagat gcatcttgga ctgtattttt 5580 
aggttggcta tgttcccagc catatccctc cggggattca tgttgtgcag aaccaecagc 5640 
acagtgtatc cggtgcactt gggaaatttg tcatgtagct tagaaggaaa tgcgtggaag 5700 
aacttggaga cgcccttgtg acctccaaga ttttccatgc attcgtccat aatgatggca 5760 
atgggcccac gggcggcggc ctgggcgaag atatttctgg gatcactaac gtcatagttg 5820 
tgttccagga tgagatcgtc ataggccatt tttacaaagc gcgggcggag ggtgccagac 5880 
tgcggtataa tggttccatc cggcccaggg gcgtagttac cctcacagat ttgcatttcc 5940 
cacgctttga gttcagatgg ggggatcatg tctacctgcg gggcgatgaa gaaaacggtt 6000 
tccggggtag gggagatcag ctgggaagaa agcaggttcc tgagcagctg <:gacttaccg 6060 
cagccggtgg gcccgtaaat cacacctatt accgggtgca actggtagtt aagagagctg 6120 
cagctgccgt catccctgag caggggggcc acttcgttaa gcatgtccct gactcgcatg 6180 
ttttccctga ccaaatccgc cagaaggcgc tcgccgccca gcgatagcag ttctt;gcaag 6240 
gaagcaaagt ttttcaacgg tttgagaccg tccgccgtag gcatgctttt; gagcgtttga 6300 
ccaagcagtt ccaggcggtc ccacagctcg gtcacctgct ctacggcatc t-cgatocagc 6360 
atatctcctc gtttcgcggg ttggggcggc tttcgctgta cggcagtagt <:ggtgctcgt -6420 
ccagacgggc cagggtcatg tctttccacg ggcgcagggt cctcgtcagc gtagtctggg 6480 
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tcacggtgaa ggggtgcgct ccgggctgcg cgctggccag ggtgcgcttg aggctggtcc 6540 
tgctggtgct gaagcgctgc cggtcttcgc cctgcgcgtc ggccaggtag catttgaeca 6600 
tggtgtcata gtccagcccc tccgcggcgt ggcccttggc gcgcagcttg cccttggagg 6660 
aggcgccgca cgaggggcag tgcagacttt tgagggcgta gagcttgggc gcgagaaata 6720 
ccgattccgg ggagtaggca tccgcgccgc aggccccgca gacggtctcg cattccacga €780 
gccaggtgag ctctggccgt tcggggtcaa aaaccaggtt tcccccatgc tttttgatgc 6840 
gtttcttacc tctggtttcc atgagccggt gtccacgctc ggtgacgaaa aggctgtccg 6900 
tgtccccgta tacagacttg agaggcctgt cctcgagcgg tgttccgcgg *cctcctcgt 6960 
atagaaactc ggaccactct gagacaaagg ctcgcgtcca ggccagcacg aaggaggcta 7020 
agtgggaggg gtagcggtcg ttgtccacta gggggtccac tcgctccagg gtgtgaagac 7080 
acatgtcgcc ctcttcggca tcaaggaagg tgattggttt gtaggtgtag gccacgtgac 7140 
cgggtgttcc tgaagggggg ctataaaagg gggtgggggc gcgttcgtcc tcactctctt 7200 
ccgcatcgct gtctgcgagg gccagctgtt ggggtgagta ctccctctga aaagcgggca 7260 
tgacttctgc gctaagattg tcagtttcca aaaacgagga ggatttgata tfccacctggc 7320 
ccgcggtgat gcctttgagg gtggccgcat ccatctggtc agaaaagaca atctttttgt 7380 
tgtcaagctt ggtggcaaac gacccgtaga gggcgttgga cagcaacttg gcgatggagc 7440 
gcagggtttg gtttttgtcg cgatcggcgc gctccttggc cgcgatgttt agctgcacgt 7S00 
attcgcgcgc aacgcaccgc cattcgggaa agacggtggt gcgcfccgtcg ggcaccaggt 7560 
gcacgcgcca accgcggttg tgcagggtga caaggtcaac gctggtggct acetctccgc 7620 
gtaggcgctc gttggtccag cagaggcggc cgcccttgcg cgagcagaat ggcggtaggg 7680 
ggtctagctg cgtctcgtcc ggggggtctg cgtccacggt aaagaccccg ggcagcaggc 7740 
gcgcgtcgaa gtagtctatc ttgcatcctt gcaagtctag cgcctgctgc catgcgcggg 7800 
cggcaagcgc gcgctcgtat gggttgagtg ggggacccca tggcatgggg tgggtgagcg 7860 
cggaggcgta catgccgcaa atgtcgtaaa cgtagagggg ctctctgagt at^ocaagat 7920 
atgtagggta gcatcttcca ccgcggatgc tggcgcgcac gtaatcgtat agttcgtgcg 7980 
agggagcgag gaggtcggga ccgaggttgc tacgggcggg ctgctctgct <:ggaagaota 8040 
tctgcctgaa gatggcatgt gagttggatg atatggttgg acgctggaag acgttgaagc 8100 
tggcgtctgt gagacctacc gcgtcacgca cgaaggaggc gtaggagtcg -cgcagcttgt 6160 
tgaccagctc ggcggtgacc tgcacgtcta gggcgcagta gtccagggtt tccttgatga 8220 
tgtcatactt atcctgtccc ttttttttcc acagctcgcg gttgaggaca aactcttcgc 8280 
ggtctttcca gtactcttgg atcggaaacc cgtcggcctc <:gaacggtaa gagcctagca 8340 
tgtagaactg gttgacggcc tggtaggcgc agcatccctt ttctacgggt agcgcgtatg 8400 
cctgcgcggc cttccggagc gaggtgtggg tgagcgcaaa ggtgtccctg aocatgactt 8460 
tgaggtactg gtatttgaag tcagtgtcgt cgcatccgcc ctgctcccag agcaaaaagt 8S20 
ccgtgcgctt tttggaacgc ggatttggca gggcgaaggt gacatcgttg aagagtatct 8S80 
ttcccgcgcg aggcataaag ttgcgtgtga tgcggaaggg -tcccggcacc tcggaacggt 8640 
tgttaattac ctgggcggcg agcacgatct cgtcaaagcc gttgatgfctg -tggcccacaa 8700 
tgtaaagttc caagaagcgc gggatgccct tgatggaagg caatttttta agfctcctcgt 8760 
aggtgagctc ttcaggggag ctgagcccgt gctctgaaag ggcccagtct gcaagatgag 8820 
ggttggaagc gacgaatgag ctccacaggt cacgggccat tagcatttgc aggtggtcgc 8880 
gaaaggtcct aaactggcga cctatggcca ttttttctgg ggtgatgcag tagaaggtaa 8940 
gcgggtcttg ttcccagcgg tcccatccaa ggttcgcggc taggtctcgc gcggcagtca 9000 
ctagaggctc atctccgccg aacttcatga ccagcatgaa gggcacgagc tgottcccaa 9060 
aggcccccat ccaagtatag gtctctacat cgtaggtgac aaagagacgc tcggtgcgag 9120 
gatgcgagcc gatcgggaag aactggatct cccgccacca attggaggag tggctatfcga 9180 
tgtggtgaaa gtagaagtcc ctgcgacggg ccgaacactc gtgctggctt ttgtaaaaac 9240 
gtgcgcagta ctggcagcgg tgcacgggct gtacatcctg cacgaggttg acctgacgac 9300 
cgcgcacaag gaagcagagt gggaatttga gcccctcgcc tggcgggttt ggctggtggt 9360 
cttctacttc ggctgcttgt ccttgaccgt ctggctgctc gaggggagtt acggtggatc 9420 
ggaccaccac gccgcgcgag cccaaagtcc agatgtccgc gcgcggcggt cggagcttga 9480 
tgacaacatc gcgcagatgg gagctgtcca tggtctggag ctcccgcggc gtcaggfccag 9S40 
gcgggagctc ctgcaggttt acctcgcata gacgggtcag ggcgcgggct agatccaggt 9600 
gatacctaat ttccaggggc tggttggtgg cggcgtcgat ggcttgcaag aggccgcafcc 9660 
cccgcggcgc gactacggta ccgcgcggcg ggcggtgggc cgcgggggtg tccttggatg 9720 
atgcatctaa aagcggtgac gcgggcgagc ccccggaggt agggggggct ccggacccgc 9780 
cgggagaggg ggcaggggca cgtcggcgcc gcgcgcgggc aggagctggt gctgcgcgcg 9840 
taggttgctg gcgaacgcga cgacgcggcg gttgatctcc tgaatctggc gcctctgcgt 9900 
gaagacgacg ggcccggtga gcttgagcct gaaagagagt tcgacagaat *caatttcggt 9960 
gtcgttgacg gcggcctggc gcaaaatctc ctgcacgtct cctgagttgt ct-tgataggc 10020 
gatctcggcc atgaactgct cgatctcttc ctcctggaga tctccgcgtc cggctcgctc 1O080 
cacggtggcg gcgaggtcgt tggaaatgcg ggccatgagc tgcgagaagg <:gttgaggec 10140 
tccctcgttc cagacgcggc tgtagaccac gccccct-tcg gcatcgcggg -cgcgcatgac 1O200 
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cacctgcgcg agattgagct ccacgtgccg ggcgaagacg gcgtagtttc gcaggcgctg 10260 
aaagaggtag ttgagggtgg tggcggtgtg ttctgccacg aagaagtaca taacccagcg 10320 
tcgcaacgtg gattcgttga tatcccccaa ggcctcaagg cgctocatgg cctcgtagaa 10380 
gtccacggcg aagttgaaaa actgggagtt gcgcgccgac acggttaact cctcctccag 10440 
aagacggatg agctcggcga cagtgtcgcg cacctcgcgc tcaaaggcta caggggcctc 10500 
ttcttcttct tcaatctcct cttccataag ggcctcccct tcttcttctt ctggcggcgg 105^0 
tgggggaggg gggacacggc ggcgacgacg gcgcaccggg aggcggtcga caaagcgctc 10620 
gatcatctcc ccgcggcgac ggcgcatggt ctcggtgacg gcgcggccgt tctcgcgggg 10680 
gcgcagttgg aagacgccgc ccgtcatgtc ccggttatgg gttggcgggg ggctgccatg 10740 
cggcagggat acggcgctaa cgatgcatct caacaattgt tgtgtaggta ctccgccgcc 10800 
gagggacctg agcgagtccg catcgaccgg atcggaaaac ctctcgagaa aggcgtxrtaa 10860 
ccagtcacag tcgcaaggta ggctgagcac cgtggcgggc ggcagcgggc ggcggfccggg 10920 
gttgtttctg gcggaggtgc tgctgatgat gtaattaaag taggcggtct tgagacggcg 10980 
gatggtcgac agaagcacca tgtccttggg tccggcctgc tgaatgcgca ggcggtcggc 11040 
catgccccag gcttcgtttt gacatcggcg caggtctttg tagtagtctt gcatgagect 11100 
ttctaccggc acttcttctt ctccttcctc ttgtcctgca tctcttgcat ctatcgctgc 11160 
ggcggcggcg gagtttggcc gtaggtggcg ccctcttcct cccatgcgtg tgaccccgaa 11220 
gcccctcatc ggctgaagca gggctaggtc ggcgacaacg cgctcggcta atatggcctg 11280 
ctgcacctgc gtgagggtag actggaagtc atccatgtcc acaaagcggt ggtatgcgcc 11340 
cgtgttgatg gtgtaagtgc agttggccat aacggaccag ttaacggtct ggtgacecgg 11400 
ctgcgagagc tcggtgtacc tgagacgcga gtaagccctc gagtcaaata cgtagtcgtt 114S0 
gcaagtccgc accaggtact ggtatcccac caaaaagfcgc ggcggcggct ggcggtagag 11520 
gggccagcgt agggtggccg gggctccggg ggcgagatct tccaacataa ggcgatgata 11580 
tccgtagatg tacctggaca tccaggtgat gccggcggcg gtggtggagg -cgcgcggaaa 11-640 
gtcgcggacg cggttccaga tgttgcgcag cggcaaaaag tgctccatgg tcgggacgct 11700 
ctggccggtc aggcgcgcgc aatcgttgac gctctagacc gtgcaaaagg agagcctgta 11760 
agcgggcact cttccgtggt ctggtggata aattcgcaag ggtatcatgg <:ggacgaecg 11820 
gggttcgagc cccgtatccg gccgtccgcc gtgatccatg cggttaccgc ccgcgtgtcg 11880 
aacccaggtg tgcgacgtca gacaacgggg gagtgctcct tttggcttcc t-tccaggcgc 11940 
ggcggctgct gcgctagctt ttttggccac tggccgcgcg cagcgtaagc ggttaggctg 12000 
gaaagcgaaa gcattaagtg gctcgctccc tgtagccgga gggttatttt -ccaagggttg 12060 
agtcgcggga cccccggttc gagtctcgga -ccggccggac tgcggcgaac gggggtttgc 12120 
ctccccgtca tgcaagaccc cgcttgcaaa ttcctccgga aacagggacg agcocctttt 12180 
ttgcttttcc cagatgcatc cggtgctgcg gcagatgcgc ccccctcctc agcagcggca 12240 
agagcaagag cagcggcaga catgcagggc accctcccct cctcctaccg .cgtcaggagg 12300 
ggcgacatcc gcggttgacg cggcagcaga tggtgattac gaacccccgc ggcgccgggc 12360 
ccggcactac ctggacttgg aggagggcga gggcctggcg cggctaggag <:gccctctcc 12420 
tgagcggtac ccaagggtgc agctgaagcg tgatacgcgt gaggcgtacg fcgccgeggca 12480 
gaacctgttt cgcgaccgcg agggagagga gcccgaggag atgcgggatc gaaagttcca 12540 
cgcagggcgc gagctgcggc atggcctgaa tcgcgagcgg ttgctgcgcg aggaggactt 12600 
tgagcccgac gcgcgaaccg ggattagtcc cgcgcgcgca <:acgtggcgg ccgccgacct 12660 
ggtaaccgca tacgagcaga cggtgaacca ggagattaac tttcaaaaaa gctfctaacaa 12720 
ccacgtgcgt acgcttgtgg cgcgcgagga ggtggctata ggactgatgc atctgtggga 12780 
ctttgtaagc gcgctggagc aaaacccaaa tagcaagccg ctcatggcgc agctgttcct 12840 
tatagtgcag cacagcaggg acaacgaggc attcagggat gcgctgctaa acatagtaga 12900 
gcccgagggc cgctggctgc tcgatttgat aaacatcctg cagagcatag -tggtgcagga 12960 
gcgcagcttg agcctggctg acaaggtggc cgccatcaac tattccatgc t<:agcctggg 13020 
caagttttac gcccgcaaga tataccatac cccttacgtt cccatagaca aggaggtaaa 13080 
gatcgagggg ttctacatgc gcatggcgct gaaggtgctt accttgagcg acgacctggg 13140 
cgtttatcgc aacgagcgca tccacaaggc cgtgagcgtg agccggcggc gcgagcfccag 13200 
cgaccgcgag ctgatgcaca gcctgcaaag ggccctggct ggcacgggca gcggcgatag 13260 
agaggccgag tcctactttg acgcgggcgc tgacctgcgc tgggccccaa gccgacgcgc 13320 
cctggaggca gctggggccg gacctgggct ggcggtggca cccgcgcgcg ctggcaacgt 13380 
cggcggcgtg gaggaatatg acgaggacga tgagtacgag ccagaggacg gcgagtacta 13440 
agcggtgatg tttctgatca gatgatgcaa gacgcaacgg acccggcggt gcgggcggcg 13500 
ctgcagagcc agccgtccgg ccttaactcc acggacgact ggcgccaggt catggaecgc 13560 
atcatgtcgc tgactgcgcg caatcctgac gcgttccggc agcagccgca ggccaaecgg 13620 
ctctccgcaa ttctggaagc ggtggtcccg gcgcgcgcaa accccacgca cgagaaggtg 13680 
ctggcgatcgf taaacgcgct ggccgaaaac agggccatcc ggcccgacga .ggccggcctg 13740 
gtctacgacg cgctgcttca gcgcgtggct cgttacaaca gcggcaacgt gcagaocaac 13600 
ctggaccggc tggtggggga tgtgcgcgag gccgtggcgc agcgtgagcg cgcgcagcag 13860 
cagggcaacc tgggctccat ggttgcacta aacgccttocc tgagtacaca -gcccgecaac 13920 
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gtgccgcggg gacaggagga ctacaccaac tttgtgagcg cactgcggct aatggtgact 13980 
gagacaccgc aaagtgaggt gtaccagtct gggccagact attttttoca gaccagtaga 14040 
caaggcctgc agaccgtaaa cctgagccag gctttcaaaa acttgcaggg gctgtggggg 14100 
gtgcgggctc ccacaggcga ccgcgcgacc gtgtctagct tgctgacgcc caacfccgcgc 14160 
ctgttgctgc tgctaatagc gcccttcacg gacagtggca gcgtgtcccg ggacacatac 14220 
ctaggtcact tgctgacact gtaccgcgag gccataggtc aggcgcatgt ggacgagcat 14280 
actttccagg agattacaag tgtcagccgc gcgctggggc aggaggacac gggcagectg 14340 
gaggcaaccc taaactacct gctgaccaac cggcggcaga agatcccctc gttgcacagt 14400 
ttaaacagcg aggaggagcg cattttgcgc tacgtgcagc agagcgtgag ccttaacctg 144-60 
atgcgcgacg gggtaacgcc cagcgtggcg ctggacatga ccgcgcgcaa catggaaccg 14520 
ggcatgtatg cctcaaaccg gccgtttatc aaccgcctaa tggactactt gcatcgcgcg 14S80 
gccgccgtga accccgagta tttcaccaat gccafccttga acccgcactg gctaccgccc 14640 
cctggtttct acaccggggg attcgaggtg cccgagggta acgatggatt <:ctctgggac 14700 
gacatagacg acagcgtgtt ttccccgcaa ccgcagaccc tgctagagtt gcaacagcgc 14760 
gagcaggcag aggcggcgct gcgaaaggaa agcttccgca ggccaagcag cttgtccgat 14820 
ctaggcgctg cggccccgcg gtcagatgct agtagcccat ttccaagctt gatagggtct 14880 
cttaccagca ctcgcaccac ccgcccgcgc ctgctgggcg aggaggagta <:ctaaacaac 14940 
tcgctgctgc agccgcagcg cgaaaaaaac ctgcctccgg <:atttcccaa caacgggata 1S000 
gagagcctag tggacaagat gagtagatgg aagacgtacg -cgcaggagca cagggacgtg 15060 
ccaggcccgc gcccgcccac ccgtcgtcaa aggcacgacc gtcagcgggg fc-ctggtgtgg 15120 
gaggacgatg actcggcaga cgacagcagc gtcctggatt tgggagggag tggcaacccg 15180 
tttgcgcacc ttcgccccag gctggggaga atgttttaaa aaaaaaaaag catgatgcaa 15240 
aataaaaaac tcaccaaggc catggcaccg agcgttggtt ttcttgtatt ccccttagta 15300 
tgcggcgcgc ggcgatgtat gaggaaggtc ctcctccctc ctacgagagt gtggtgagcg 1S360 
cggcgccagt ggcggcggcg ctgggttctc ccttcgafcgc tcccctggac ccgccgtttg 1S420 
tgcctccgcg gtacctgcgg cctaccgggg ggagaaacag catccgttac tctgagttgg 15480 
cacccctatt cgacaccacc cgtgtgtacc tggtggacaa caagtcaacg gatgtggcat 15540 
ccctgaacta ccagaacgac cacagcaact ttctgaccac ggtcattcaa aacaatgact 15600 
acagcccggg ggaggcaagc acacagacca tcaatcttga -cgaccggtcg cactggggcg 15660 
gcgacctgaa aaccatcctg cataccaaca tgccaaatgt gaacgagttc atgtttacca 15720 
ataagtttaa ggcgcgggtg atggtgtcgc gcttgcctac taaggacaat caggtggagc 15780 
tgaaatacga gtgggtggag ttcacgctgc ccgagggcaa ctactccgag accatgacca 15840 
tagaccttat gaacaacgcg atcgtggagc actacttgaa agtgggcaga cagaacgggg 15900 
ttctggaaag cgacatcggg gtaaagtttg acacccgcaa cttcagactg gggtttgacc 15960 
ccgtcactgg tcttgtcatg cctggggtat atacaaacga agccttocat ccagacatca 16020 
ttttgctgcc aggatgcggg gtggacttca cccacagccg -cctgagcaac ttgttgggca 16080 
tccgcaagcg gcaacccttc caggagggct ttaggatcac ctacgatgat ctggagggtg 16140 
gtaacattcc cgcactgttg gatgtggacg cctaccaggc gagcttgaaa gatgacaccg 16200 
aacagggcgg gggtggcgca ggcggcagca acagcagtgg cagcggcgcg gaagagaact 16260 
ccaacgcggc agccgcggca atgcagccgg tggaggacat gaacgatcat gccatlx:gcg 16320 
gcgacacctt tgccacacgg gctgaggaga agcgcgctga ggccgaagca gcggccgaag 16380 
ctgccgcccc cgctgcgcaa cccgaggtcg agaagcctca gaagaaaccg gtgatcaaac 16440 
ccctgacaga ggacagcaag aaacgcagtt acaacctaat aagcaatgac agcaccttca 16500 
cccagtaccg cagctggtac cttgcataca actacggcga ccctcagacc ggaatccgct 16560 
catggaccct gctttgcact cctgacgtaa cctgcggctc ggagcaggtc tactggt-cgt 16620 
tgccagacat gatgcaagac cccgtgacct tccgctccac gcgccagato agcaactttc 16680 
cggtggtggg cgccgagctg ttgcccgtgc actccaagag cttctacaac gaocaggccg 16740 
tctactccca actcatccgc cagtttacct ctctgaccca cgtgttcaat cgctttcccg 16800 
agaaccagat tttggcgcgc ccgccagccc ccaccafecac caccgtcagt gaaaacgttc 16860 
ctgctctcac agatcacggg acgctaccgc tgcgcaacag catcggagga gtccagcgag 16920 
tgaccattac tgacgccaga cgccgcacct gcccctacgt ttacaaggcc <:tgggcatag 16980 
tctcgccgcg cgtcctatcg agccgcactt tttgagcaag catgtocafcc cttatatcgc 17040 
ccagcaataa cacaggctgg ggcctgcgct tcccaagcaa gatgtttggc ggggccaaga 17100 
agcgctccga ccaacaccca gtgcgcgtgc gcgggcacta ccgcgcgccc tggggcgcgc 17160 
acaaacgcgg ccgcactggg cgcaccaccg tcgatgacgc catcgacgcg gtggtggagg 17220 
aggcgcgcaa ctacacgccc acgccgccac cagtgtccac agtggacgcg gccattcaga 17280 
ccgtggtgcg cggagcccgg cgctatgcta aaatgaagag acggcggagg <:gcgtagcac 17340 
gtcgccaccg ccgccgaccc ggcactgccg rccaacgcgc ggcggcggcc ctgcttaacc 17400 
gcgcacgtcg caccggccga cgggcggcca tgcgggccgc tcgaaggctg gccgcgggta 17460 
ttgtcactgt gccccccagg tccaggcgac gagcggccgc -cgcagcagcc gcggecatta 17520 
gtgctatgac tcagggtcgc aggggcaacg tgtattgggt gcgcgacfccg gttagcggcc 17580 
tgcgcgtgcc cgtgcgcacc cgccccccgc gcaactagat -tgcaagaaaa aactacttag 17640 
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actcgtactg ttgtatgtat ccagcggcgg cggcgcgcaa cgaagctatg tccaagcgca 17700 
aaatcaaaga agagatgctc caggtcatcg cgccggagat ctatggcccc -ccgaagaagg 17760 
aagagcagga ttacaagccc cgaaagctaa agcgggtcaa aaagaaaaag aaagatgatg 17820 
atgatgaact tgacgacgag gtggaactgc tgcacgctac cgcgcccagg cgacgggtac 17880 
agtggaaagg tcgacgcgta aaacgtgttt tgcgacccgg caccaccgta gtctttacgc 17940 
ccggtgagcg ctccacccgc acctacaagc gcgtgtatga tgaggtgtac ggcgacgagg 18000 
acctgcttga gcaggccaac gagcgcctcg gggagtttgc ctacggaaag -cggcataagg 18060 
acatgctggc gttgccgctg gacgagggca acccaacacc tagcctaaag cccgtaacac 18120 
tgcagcaggt gctgcccgcg cttgcaccgt ccgaagaaaa gcgcggccta aagcgcgagt 18180 
ctggtgactt ggcacccacc gtgcagctga tggtacccaa gcgccagcga <?tggaagatg 18240 
tcttggaaaa aatgaccgtg gaacctgggc tggagcccga ggtccgcgtg -cggccaatca 18300 
agcaggtggc gccgggactg ggcgtgcaga ccgtggacgt tcagataccc actaccagta 18360 
gcaccagtat tgccaccgcc acagagggca tggagacaca aacgtccccg gttgcctcag 18420 
cggtggcgga tgccgcggtg caggcggtcg ctgcggocgc gtccaagacc tctacggagg 18480 
tgcaaacgga cccgtggatg tttcgcgttt cagccccccg gcgcccgcgc ggttcgagga 18540 
agtacggcgc cgccagcgcg ctactgcccg aatatgccct acatccttcc attgcgccta 18600 
cccccggcta tcgtggctac acctaccgcc ccagaagacg agcaactacc cgacgccgaa .18660 
ccaccactgg aacccgccgc cgccgtcgcc gfccgccagcc cgtgctggcc ccgatttccg 18720 
tgcgcagggt ggctcgcgaa ggaggcagga ccctggtgct gccaacagcg cgctaecacc 18780 
ccagcatcgt ttaaaagccg gtctttgtgg ttcttgcaga tatggccctc acctgccgcc 18840 
tccgtttccc ggtgccggga ttccgaggaa gaatgcaccg taggaggggc atggccggcc 18900 
acggcctgac gggcggcatg cgtcgtgcgc accaccggcg gcggcgcgcg tcgcaccgtc 18960 
gcatgcgcgg cggtatcctg cccctcctta ttccactgat cgccgcggcg attggcgccg 19020 
tgcccggaat tgcatccgtg gccttgcagg cgcagagaca ctgattaaaa acaagttgca 19080 
tgtggaaaaa tcaaaataaa aagtctggac tct<:acgctc gcttggtcct gtaactattt 19140 
tgtagaatgg aagacatcaa ctttgcgtct ctggccccgc gacacggctc gcgcocgttc 19200 
atgggaaact ggcaagatat cggcaccagc aatatgagcg gtggcgcctt cagctggggc 19260 
tcgctgtgga gcggcattaa aaatttcggt tccaccgtta agaactatgg «cagcaaggcc 19320 
tggaacagca gcacaggcca gatgctgagg gataagttga aagagcaaaa tttccaacaa 19380 
aaggtggtag atggcctggc ctctggcatt agcggggtgg tggacctggc caaccaggca 19440 
gtgcaaaata agattaacag taagcttgat ccccgccctc ccgtagagga gcctccaccg 19500 
gccgtggaga cagtgtctcc agaggggcgt ggcgaaaagc gtccgcgccc cgacagggaa 19S60 
gaaactctgg tgacgcaaat agacgagcct ccctcgtacg aggaggcact aaagcaaggc 19620 
ctgcccacca cccgtcccat cgcgcccatg gctaccggag tgctgggcca gcacacaccc 19680 
gtaacgctgg acctgcctcc ccccgccgac acccagcaga aacctgtgct gccaggcccg 19740 
accgccgttg ttgtaacccg tcctagccgc gcgtccctgc gccgcgccgc -cagcggfcccg 19800 
cgatcgttgc ggcccgtagc cagtggcaac tggcaaagca cactgaacag catcgtgggt 19860 
ctgggggtgc aatccctgaa gcgccgacga tgcttctgaa tagctaacgt gtcgtatgtg 19920 
tgtcatgtat gcgtccatgt cgccgccaga ggagctgctg agccgccgcg *cgccogcttt 19980 
ccaagatggc taccccttcg atgatgccgc agtggtctta catgcacatc tcgggecagg 20040 
acgcctcgga gtacctgagc cccgggctgg tgcagtttgc ccgcgccacc gagacgtact 20100 
tcagcctgaa taacaagttt agaaacccca cggtggcgcc tacgcacgac gtgaccacag 20160 
accggtccca gcgtttgacg ctgcggttx:a tccctgtgga <:cgtgaggat actgcgtact 20220 
cgtacaaggc gcggttcacc ctagctgtgg gtgataaccg tgtgctggac atggcttcca 20280 
cgtactttga catccgcggc gtgctggaca ggggccctac ttttaagccc tact'ctggca 20340 
ctgcctacaa cgccctggct cccaagggtg ccccaaatcc fctgcgaatgg gatgaagctg 20400 
ctactgctct tgaaataaac ctagaagaag aggacgatga x:aacgaagac gaagtagacg 204€0 
agcaagctga gcagcaaaaa actcacgtat ttgggcaggc gccttattct ggtataaata 20520 
ttacaaagga gggtattcaa ataggtgtcg aaggtcaaac acctaaatat gccgataaaa 20580 
catttcaacc tgaacctcaa ataggagaat ctcagt^gta -cgaaactgaa attaatcatg 20640 
cagctgggag agtccttaaa aagactaccc caatgaaacc atgttacggt tcatatgcaa 20700 
aacccacaaa tgaaaatgga gggcaaggca ttcttgtaaa gcaacaaaat ggaaagctag 20760 
aaagtcaagt ggaaatgcaa tttttctcaa ctactgaggc gaccgcaggc aatggtgata 20820 
acttgactcc taaagtggta ttgtacagtg aagatgtaga tatagaaacc -ccagacactc 20880 
atatttctta catgcccact attaaggaag gtaactcacg agaactaatg ggccaacaat 20940 
ctatgcccaa caggcctaat tacattgctt ttagggacaa ttttattggt <:taatgtatt 21000 
acaacagcac gggtaatatg ggtgttctgg cgggccaagc atcgcagttg aatgctgttg 210S0 
tagatttgca agacagaaac acagagcttt cataccagct tttgcttgat tccattggtg 21120 
atagaaccag gtacttttct atgtggaatc aggctgt^ga cagctatgat <x:agatgtta 211-80 
gaattattga aaatcatgga actgaagatg aacttccaaa ttactgcttt -ccactgggag 21240 
gtgtgattaa tacagagact cttaccaagg taaaacctaa aacaggtcag gaaaatggat 21300 
gggaaaaaga tgctacagaa ttttcagata aaaatgaaat aagagttgga aataattfctg 21360 
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ccatggaaat caatctaaat gccaacctgt ggagaaattt cctgtactcc aacatagcgc 21420 
tgtatttgcc cgacaagcta aagtacagtc cttccaacgt aaaaatttct gataacccaa 21480 
acacctacga ctacatgaac aagcgagtgg tggctcccgg gttagtggac tgctacatta 21540 
accttggagc acgctggtcc cttgactata tggacaacgt caacccattt aaccaccacc 21600 
gcaatgctgg cctgcgctac cgctcaatgt tgctgggcaa tggtcgctat gtgcccttcc 21660 
acatccaggt gcctcagaag ttctttgcca ttaaaaacct ccttctcctg ccgggctcat 21720 
acacctacga gtggaacttc aggaaggatg ttaacatggt tctgcagagc tccctaggaa 21780 
atgacctaag ggttgacgga gccagcatta agtttgatag catttgcctt tacgccacct 21840 
tcttccccat ggcccacaac accgcctcca cgcttgaggc catgcttaga aacgacacca 21900 
acgaccagtc ctttaacgac tatctctccg ccgccaacat gctctaccct atacccgcca 21960 
acgctaccaa cgtgcccata tccatcccct cccgcaactg ggcggctttc cgcggctggg 22020 
ccttcacgcg ccttaagact aaggaaaccc catcactggg -ctcgggctac gacccttatt 22080 
acacctactc tggctctata ccctacctag atggaacctt ttacctcaac cacaccttta 22140 
agaaggtggc cattaccttt gactcttctg tcagctggcc tggcaatgac cgcctgefcta 22200 
cccccaacga gtttgaaatt aagcgctcag ttgacgggga gggttacaac gttgcccagt 22260 
gtaacatgac caaagactgg ttcctggtac aaatgctagc taactacaac attggctacc 22320 
agggcttcta tatcccagag agctacaagg accgcatgta ctccttcttt agaaacttcc 22380 
agcccatgag ccgtcaggtg gtggatgata ctaaatacaa ggactaccaa caggtgggca 22440 
tcctacacca acacaacaac tctggatttg ttggctacct tgcccccacc atgcgcgaag 22S00 
gacaggccta ccctgctaac ttcccctatc. cgcttatagg caagaccgca gttgacagca 22560 
ttacccagaa- aaagtttctt tgcgatcgca ccctttggcg catcccattc tocagtaact 22620 
ttatgtccat gggcgcactc acagacctgg gccaaaacct tctctacgcc aactccgccc 22*680 
acgcgctaga catgactttt gaggtggatc ccatggacga gcocaccctt ctttatgttt 22740 
tgtttgaagt ctttgacgtg gtccgtgtgc accggccgca ccgcggcgtc atcgaaaccg 22800 
tgtacctgcg cacgcccttc tcggccggca acgccacaac ataaagaagc aagcaacatc 22860 
aacaacagct gccgccatgg gctccagtga gcaggaactg aaagccattg fccaaagatct 22920 
tggttgtggg ccatattttt tgggcaccta tgacaagcgc tttccaggct ttgtttctcc 22980 
acacaagctc gcctgcgcca tagtcaatac ggccggtcgc gagactgggg gcgtacactg 23040 
gatggccttt gcctggaacc cgcactcaaa aacatgctac ctctttgagc cctttggctt 23100 
ttctgaccag cgactcaagc aggtttacca gtttgagtac gagtcactcc tgcgccgtag 23160 
cgccattgct tcttcccccg accgctgtat aacgctggaa aagtccaccc aaagcgtaca 23220 
ggggcccaac tcggccgcct gtggactatt ctgctgcatg tttctocacg cctttgccaa 23280 
ctggccccaa actcccatgg atcacaaccc caccatgaac cttattaccg gggtacccaa 23340 
ctccatgctc aacagtcccc aggtacagcc caccctgcgt cgcaaccagg aacagctcta 23400 
cagcttcctg gagcgccact cgccctactt ccgcagccac agtgcgcaga ttaggagcgc 23460 
cacttctttt tgtcacttga aaaacatgta aaaataatgt actagagaca cttt<:aataa 23520 
aggcaaatgc ttttatttgt acactctcgg gtgattattt acccccaccc ttgccgtctg 23580 
cgccgtttaa aaatcaaagg ggttctgccg cgcatcgcta tgcgccactg gcagggacac 23640 
gttgcgatac tggtgtttag tgctccactt aaactcaggc acaaccatcc gcggcagctc 23700 
ggtgaagttt tcactccaca ggctgcgcac catcaccaac gcgtttagca ggtcgggcgc 23760 
cgatatcttg aagtcgcagt tggggcctcc gccctgcgcg cgcgagttgc gatacacagg 23820 
gttgcagcac tggaacacta tcagcgccgg gtggtgcacg ctggccagca cgctcttgtc -23880 
ggagatcaga tccgcgtcca ggtcctccgc gttgctcagg gcgaacggag tcaactttgg 23940 
tagctgcctt cccaaaaagg gcgcgtgccc aggctttgag ttgcacfccgc accgtagtgg 24000 
catcaaaagg tgaccgtgcc cggtctgggc gttaggatac agcgcctgca taaaagcctt 24060 
gatctgctta aaagccacct gagcctttgc gccttcagag aagaacatgc cgcaagactt 24120 
gccggaaaac tgattggccg gacaggccgc gtcgtgcacg -cagcaccttg cgtcggtgtt 24180 
ggagatctgc accacatttc ggccccaccg gttctfccacg atcfctggcct tgctagactg 24240 
ctccttcagc gcgcgctgcc cgttttcgct cgtcacatcc atttcaatca cgtgctcctt 24300 
atttatcata atgcttccgt gtagacactt aagctcgcct tcgatctcag ^cgcagcggtg 24360 
cagccacaac gcgcagcccg tgggctcgtg atgcttgtag gtrcacctctg caaacgactg 54420 
caggtacgcc tgcaggaatc gccccatcat <:gtcacaaag gtcttgttgc tggtgaaggt 24480 
cagctgcaac ccgcggtgct cctcgttcag ccaggtcttg -catacggccg ccagagcttc 24540 
cacttggtca ggcagtagtt tgaagttcgc ctttagatcg ttatccacgt ggtacctgtc 24600 
catcagcgcg cgcgcagcct ccatgccctt ctcccacgca gacacgatcg gcacactcag 24660 
cgggttcatc accgtaattt cactttccgc ttcgctgggc tcttcctct* -cctcttgcgt 24720 
ccgcatacca cgcgccactg ggtcgtcttc attcagccgc cgcactgtgc gcttacctcc 24780 
tttgccatgc ttgattagca ccggtgggtt gctgaaaccc accatfctgta gcgccacatc 24€40 
ttctctttct. tcctcgctgt ccacgattac ct-ctggtgat ggcgggcgct -cgggcttggg 24900 
agaagggcgc ttctttttct tcttgggcgc aatggccaaa tccgccgccg aggtogatgg 24960 
ccgcgggctg ggtgtgcgcg gcaccagcgc gtcttgtgat gagtcttcct -cgtcctcgga -25020 
ctcgatacgc cgcctcatcc gcttttttgg gggcgcccgg ggaggcggcg gcgacgggga 25080 
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cggggacgac acgtcctcca tggttggggg acgtcgcgcc gcaccgcgtc cgcgctcggg 25140 
ggtggtttcg cgctgctcct cttcccgact ggccatttcc ttctcctata ggcagaaaaa 2S200 
gatcatggag tcagtcgaga agaaggacag cctaaccgcc ccctctgagt fccgccaccac 25260 
cgcctccacc gatgccgcca acgcgcctac caccttcccc gtcgaggcac ccccgcttga -25320 
ggaggaggaa gtgattatcg agcaggaccc aggttttgta agcgaagacg acgaggaccg 25380 
ctcagtacca acagaggata aaaagcaaga ccaggacaac gcagaggcaa acgaggaaca 25440 
agtcgggcgg ggggacgaaa ggcatggcga ctacctagat gtgggagacg acgtgctgtt 25S00 
gaagcatctg cagcgccagt gcgccattat ctgcgacgcg ttgcaagagc gcagcgatgt 25560 
gcccctcgcc atagcggatg tcagccttgc ctacgaacgc cacctattct caccgcgcgt -25620 
accccccaaa cgccaagaaa acggcacatg cgagcccaac ccgcgcctea acttctaccc 25680 
cgtatttgcc gtgccagagg tgcttgccac ctatcacatc tttttccaaa actgcaagat 25740 
acccctatcc tgccgtgcca accgcagccg agcggacaag cagctggcct tgcggcaggg 2S800 
cgctgtcata cctgatatcg cctcgctcaa cgaagtgcca aaaatxrtttg agggtcttgg 25860 
acgcgacgag aagcgcgcgg caaacgctct gcaacaggaa aacagcgaaa atgaaagtca 2S920 
ctctggagtg ttggtggaac tcgagggtga caacgcgcgc ctagccgtac taaaacgcag 25980 
catcgaggtc acccactttg cctacccggc acttaaccta ccccccaagg tcatgagcac 26040 
agtcatgagt gagctgatcg tgcgccgtgc gcagcccctg gagagggatg caaatttgca 26100 
agaacaaaca gaggagggcc tacccgcagt tggcgacgag -cagctagcgc gctggcttca 26160 
aacgcgcgag cctgccgact tggaggagcg acgcaaacta atgatggccg cagtgcfccgt 26220 
taccgtggagr cttgagtgca tgcagcggtt ctttgctgac ocggagatgc agcgcaagct 26280 
agaggaaaca ttgcactaca cctttcgaca gggctacgta cgccaggcct gcaagatctc 26340 
caacgtggag ctctgcaacc tggtctccta ccttggaatt ttgcacgaaa accgccttgg 26400 
gcaaaacgtg cttcattcca cgctcaaggg cgaggcgcgc cgcgactacg tccgcgactg 26460 
cgtttactta tttctatgct acacctggca, gacggccatg ggcgtttggc agcagtgctt 26520 
ggaggagtgc aacctcaagg agctgcagaa actgctaaag caaaacttga aggacctatg 26580 
gacggccttc aacgagcgct ccgtggccgc gcacctggcg gacatcattt tccccgaacg 26640 
cctgcttaaa accctgcaac agggtctgcc agacttcacc agtcaaagca tgttgcagaa 26700 
ctttaggaac tttatcctag agcgctcagg aatcttgccc gecacctgct gtgcacttcc 26760 
tagcgacttt gtgcccatta agtaccgcga atgccctccg -ccgctttggg gccactgcta 26820 
ccttctgcag ctagccaact accttgccta ccactctgac ataatggaag acgtgagcgg 26880 
tgacggtcta ctggagtgtc actgtcgctg caacctatgc accccgcacc gctcectggt 26940 
ttgcaattcg cagctgctta acgaaagtca aattatcggt acctttgagc tgcagggtcc 27000 
ctcgcctgac gaaaagtccg cggctccggg gttgaaactc actccggggc tgtggacgtc 27060 
ggcttacctt cgcaaatttg tacctgagga ctaccacgcc cacgagatta ggttctaoga 27120 
agaccaatcc cgcccgccaa atgcggagct taccgcctgc gtcattaccc agggccacat 27180 
tcttggccaa ttgcaagcca tcaacaaagc ccgccaagag tttctgctac gaaagggacg 27240 
gggggtttac ttggaccccc agtccggcga ggagctcaac ccaatccccc -cgccgccgca 27300 
gccctatcag cagcagccgc gggcccttgc ttcccaggat ggcacccaaa aagaagctgc 27360 
agctgccgcc gccacccacg gacgaggagg aatactggga cagfrcaggca gaggaggttt 27420 
tggacgagga ggaggaggac atgatggaag actgggagag cctagacgag gaagcttccg 27480 
aggtcgaaga ggtgtcagac gaaacaccgt caccctcggt cgcattcccc *cgccggcgc 27S40 
cccagaaatc ggcaaccggt tccagcatgg ctacaacctc cgctcctcag gcgecgccgg 27-600 
cactgcccgt tcgccgaccc aaccgtagat gggacaccac tggaaccagg gccggtaagt 27660 
ccaagcagcc gccgccgtta gcccaagagc aacaacagcg ccaaggctac -cgctcatggc 27750 
gcgggcacaa gaacgccata gttgcttgct tgcaagactg tgggggcaac atctcctfccg 27780 
cccgccgctt tcttctctac catcacggcg tggccttccc ccgtaacatc otgcattact 27840 
accgtcatct ctacagccca tactgcaccg gcggcagcgg cagcggcagc aacagcagcg 27900 
gccacacaga agcaaaggcg accggatagc aagactctga caaagcccaa gaaatccaca 27960 
gcggcggcag cagcaggagg aggagcgctg cgtctggcgc ccaacgaacc <cgtafccgacc 28020 
cgcgagctta gaaacaggat ttttcccact ctgtatgcta tatttzcaaca gagcaggggc 28080 
caagaacaag agctgaaaat aaaaaacagg tctctgcgat <:cctcacccg <cagctgcctg 28140 
tatcacaaaa gcgaagatca gcttcggcgc acgctggaag acgcggaggc tctcttcagt 28200 
aaatactgcg cgctgactct taaggactag tttcgcgccc tttctcaaat ttaagcgcga 28260 

asarl-arafr afrfrranm nrrararrrn nnnrranrar ot-<-rt- r*rr1-r*=»n rrinnar + al-ct 9R^90 
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ttcacgcctc gtcaggcaat cctaactctg cagacctcgt cctctgagcc gcgetctgga 28860 
ggcattggaa ctctgcaatt tattgaggag tttgtgccat cggtctactt taaccccttc 28920 
tcgggacctc ccggccacta tccggatcaa tttattccta actttgacgc ggtaaaggac 28980 
tcggcggacg gctacgactg aatgttaagt ggagaggcag agcaactgcg cctgaaacac 29040 
ctggtccact gtcgccgcca caagtgcttt gcccgcgact ccggtgagtt ttgctacttt 29100 
gaattgcccg aggatcatat cgagggcccg gcgcacggcg tccggcttac cgcccaggga 29160 
gagcttgccc gtagcctgat tcgggagttt acccagcgcc ccctgctagt tgagcgggac 29220 
aggggaccct gtgttctcac tgtgatttgc aactgtccta accttggatt acatcaagat 29280 
ttaattaatt gccacatcct cttacacttt ttcatacatt gcccaagaat aaagaatcgt 29340 
ttgtgttatg tttcaacgtg tttatttttc aattgcagaa aatttcaagt cattfcttcat 29400 
tcagtagtat agccccacca ccacatagct tatacagatc accgtacctt aatcaaactc 29460 
acagaaccct agtattcaac ctgccacctc cctcccaaca cacagagtac acagtccttt 29520 
ctccccggct ggccttaaaa agcatcatat catgggtaac agacatattc ttaggtgtta 29S80 
tattccacac ggtttcctgt cgagccaaac gctcatcagt gatattaata aactccccgg 29640 
gcagctcact taagttcatg tcgctgtcca gctgctgagc -cacaggctgc . tgtccaactt 29700 
gcggttgctt aacgggcggc gaaggagaag tccacgccta catgggggta gagtcataat 29760 
cgtgcatcag gatagggcgg tggtgctgca gcagcgcgcg aataaactgc tgccgccgcc 29820 
gctccgtcct gcaggaatac aacatggcag tggtctoctc agcgatgatt -cgcaccgccc 29880 
gcagcataag gcgccttgtc ctccgggcac agcagcgcac cctgatctca cttaaatcag 29940 
cacagtaact gcagcacagc accacaatat tgttcaaaat cccacagtgc aaggcgctgt 30000 
atccaaagct catggcgggg accacagaac ccacgtggcc atcataccac aagcgcaggt 30060 
agattaagtg gcgacccctc ataaacacgc tggacataaa cattacct<:t tetggcatgt 3012O 
tgtaattcac cacctcccgg taccatataa acctctgatt aaacatggcg ocatccacca 30180 
ccatcctaaa ccagctggcc aaaacctgcc cgccggctat acactgcagg gaaccgggac 30240 
tggaacaatg acagtggaga gcccaggact cgtaaccatg gatcatoatg ctcgfccatga 30300 
tatcaatgtt ggcacaacac aggcacacgt gcatacactt cctcaggatt acaagctcct 30360 
cccgcgttag aaccatatcc cagggaacaa cccattcctg aatcagcgta aatcccacac 30420 
tgcagggaag acctcgcacg taactcacgt tgtgcattgt -caaagtgtta -cattcgggca 30480 
gcagcggatg atcctccagt atggtagcgc gggtttctgt ctcaaaagga ggtagacgat 30S40 
ccctactgta cggagtgcgc cgagacaacc gagatcgtgt fcggtcgtagt gtcatgccaa 30600 
atggaacgcc ggacgtagtc atatttcctg aagcaaaacc aggtgcgggc gtgacaaaca 30660 
gatctgcgtc tccggtctcg ccgcttagat cgctctgtgt agtagttgta gtatatccac 30720 
tctctcaaag catccaggcg ccccctggct tcgggttcta tgtaaactcc ttcatgcgcc 30780 
gctgccctga taacatccac caccgcagaa taagccacac ccagccaacc tacacattcg 30840 
ttctgcgagt cacacacggg aggagcggga agagctggaa gaaccatgtt ttttttttta 30900 
ttccaaaaga ttatccaaaa cctcaaaatg aagatctatt aagtgaacgc gctcccctcc 30960 
ggtggcgtgg tcaaactcta cagccaaaga acagataatg gcatttgtaa gatgttgcac 31020 
aatggcttcc aaaaggcaaa cggccctcac gtccaagtgg acgtaaaggc fcaaacccttc 3 10 80 
agggtgaatc tcctctataa acattccagc accttcaacc atgcccaaat aattctcatc 31140 
tcgccacctt ctcaatatat ctctaagcaa atcccgaata ttaagfcccgg .ccattgtaaa 31200 
aatctgctcc agagcgccct ccaccttcag cctcaagcag cgaafccatga ttgcaaaaat 31260 
tcaggttcct cacagacctg tataagattc aaaagcggaa cattaacaaa aataccgcga 31320 
tcccgtaggt cccttcgcag ggccagctga acataatcgt gcaggtctgc acggaccagc 31380 
gcggccactt ccccgccagg aaccttgaca aaagaaccca cactgattat gacacgcata 31440 
ctcggagcta tgctaaccag cgtagccccg atgtaagctt tgttgcatgg gcggcgatat 31S00 
aaaatgcaag gtgctgctca aaaaatcagg caaagcctcg cgcaaaaaag aaagcacatc 31560 
gtagtcatgc tcatgcagat aaaggcaggt aagctccgga accaccacag aaaaagacac 31620 
catttttctc tcaaacatgt ctgcgggttt <:tgcataaac acaaaataaa ataacaaaaa 31680 
aacatttaaa cattagaagc ctgtcttaca acaggaaaaa caacccttat aagcataaga 31740 
cggactacgg ccatgccggc gtgaccgtaa aaaaactggt -caccgtgatt aaaaagcacc 31800 
accgacagct cctcggtcat gtccggagtc ataatgtaag actcggtaaa .cacatcaggt 31860 
tgattcatcg gtcagtgcta aaaagcgacc gaaatagccc gggggaatac atacccgcag 31920 
gcgtagagac aacattacag cccccatagg aggtataaca aaattaatag gagagaaaaa 31980 
cacataaaca cctgaaaaac cctcctgcct aggcaaaata gcaccctccc gctccagaac 32040 
aacatacagc gcttcacagc ggcagcctaa cagtcagcct taccagtaaa aaagaaaacc 32100 
tattaaaaaa acaccactcg acacggcacc agctcaatca gtcacagtgt aaaaaagggc 32160 
caagtgcaga gcgagtatat ataggactaa aaaatgacgt aacggttaaa gtccacaaaa 32220 
aacacccaga aaaccgcacg cgaacctacg cccagaaacg aaagccaaaa aacccacaac 32280 
ttcctcaaat cgtcacttcc gttttcccac gttacgtaac *tcccatttt aagaaaacta 32340 
caattcccaa cacatacaag ttactccgcc ctaaaaccta cgtcacccgc cccgttccca 32400 
cgccccgcgc cacgtcacaa actccacccc ctcattatca tattggcttc aatccaaaat 32460 
aaggtatatt attgatgatg " 32480 
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<210> 24 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 24 

ctcaacaatt gtggatccgt actcc 25 

<210> 25 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 25 

gtgctcagca gatcttgcga ctgtg 25 

<210> 2(6 . 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 26 

ggcgcgttcg gatccactct cttcc 25 

<210> 27 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 27 

ctacatgcta ggcagatctc gttcggag 28 

<210> 28 
<211> 1240 
<212> DNA 
<213> adenovirus 

<400> 28 

ggatccactc tcttccgcat cgctgtctgc gagggccagc tgttggggtg agtactccct €0 
ctgaaaagcg ggcatgactt ctgcgctaag attgtcagtt tccaaaaacg aggaggattt 120 
gatattcacc tggcccgcgg tgatgccttt gagggtggcc gcatccatct ggtcagaaaa 180 
gacaatcttt ttgttgtcaa gcttggtggc aaacgacccg tagagggcgt -tggacagcaa -240 
cttggcgatg gagcgcaggg tttggttttt gtcgcgatcg gcgcgctcct tggccgcgat 300 
gtttagctgc acgtattcgc gcgcaacgca ccgccattcg ggaaagacgg tggtgcgctc 360 
gtcgggcacc aggtgcacgc gccaaccgcg gttgtgcagg gtgacaaggt .caaxxfctggt 420 
ggctacctct ccgcgtaggc gctcgttggt ccagcagagg cggccgcect fcgcgcgagca 480 
gaatggcggt agggggtcta gctgcgtctc gtccgggggg tctgcgtcca cggtaaagac 540 
cccgggcagc aggcgcgcgt cgaagtagtc tatcttgcat ccttgcaagt ctagcgectg 600 
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ctgccatgcg cgggcggcaa gcgcgcgctc gtatgggttg agtgggggac occatggcat €60 
999gtgggtg agcgcggagg cgtacatgcc gcaaatgtcg taaacgtaga ggggctctct 720 
gagtattcca agatatgtag ggtagcatct tccaccgcgg atgctggcgc gcacgtaatc 780 
gtatagttcg tgcgagggag cgaggaggtc gggaccgagg ttgctacggg cgggctgct-c 840 
tgctcggaag actatctgcc tgaagatggc atgtgagttg gatgatatgg ttggacgctg 900 
gaagacgttg aagctggcgt ctgtgagacc taccgcgtca cgcacgaagg aggcgtagga 960 
gtcgcgcagc ttgttgacca gctcggcggt gacctgcacg tctagggcgc agtagtccag 1020 
ggtttccttg atgatgtcat acttatcctg tccctttttt ttccacagct cgcggttgag 1080 
gacaaactct- tcgcggtctt tccagtactc ttggatcgga aacccgfccgg cctocgaacg 1140 
agatccgtac tccgccgccg agggacctga gcgagtccgc atcgaccgga tcggaaaacc 1200 
tctcgagaaa ggcgtctaac cagtcacagt cgcaagatct " ~ 1240 

<210> 29 
<211> 8383 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid pDV60 
<400> 29 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 - 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttec 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaacoca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gcttggtacc 900 
gagctcggat ccactctctt ccgcatcgct gtctgcgagg gccagctgtt ggggtgagta 960 
ctccctctga aaagcgggca tgacttctgc gctaagattg tcagtttcca aaaacgagga 1020 
ggatttgata ttcacctggc ccgcggtgat gcctttgagg gtggccgcat ccatetggtc 1080 
agaaaagaca atctttttgt tgtcaagctt ggtggcaaac gacccgtaga gggcgttgga 1140 
cagcaacttg gcgatggagc gcagggtttg gtttttgtcg -cgatcggcgc gctcctttggc 1200 
cgcgatgttt agctgcacgt attcgcgcgc aacgcaccgc cattcgggaa agacggtggt 1260 
gcgctcgtcg ggcaccaggt gcacgcgcca accgcggttg tgcagggtga caaggtcaac 1320 
gctggtggct acctctccgc gtaggcgctc gttggtccag cagaggcggc -cgcccttgcg 1380 
cgagcagaat ggcggtaggg ggtctagctg cgtctcgtcc ggggggtctg cgtccacggt 1440 
aaagaccccg ggcagcaggc gcgcgtcgaa gtagtctatc ttgcatcctt gcaagtctag 1S00 
cgcctgctgc catgcgcggg cggcaagcgc gcgctcgtat gggttgagtg ggggacccca 1560 
tggcatgggg tgggtgagcg cggaggcgta catgccgcaa atgtcgtaaa cgtagagggg 1620 
ctctctgagt attccaagat atgtagggta gcatcttcca ccgcggatgc tggcgcgcac 1680 
gtaatcgtat agttcgtgcg agggagcgag gaggtcggga ccgaggttgc tacgggcggg 1740 
ctgctctgct cggaagacta tctgcctgaa gatggcatgt gagttggatg atatggttgg 1800 
acgctggaag acgttgaagc tggcgtctgt gagacctacc gcgtcacgca cgaaggaggc 1860 
gtaggagtcg cgcagcttgt tgaccagctc ggcggtgacc -tgcacgtcta gggcgcagta 1920 
gtccagggtt tccttgatga tgtcatactt atcctgtccc t^ttttttcc acagcfccgcg 1980 
gttgaggaca aactcttcgc ggtctttcca gtactcttgg atcggaaacc cgtx:ggcctc 2040 
cgaacgagat ccgtactccg ccgccgaggg acctgagcga gtccgcatcg accggatcgg 2100 
aaaacctctc gagaaaggcg tctaaccagt cacagtcgca agatccaaga tgaagcgcgc 2160 
aagaccgtct gaagatacct tcaaccccgt gtatocatat gacacggaaa ccggtcctcc 2220 
aactgtgcct tttcttactc ctccctttgt atcccccaat gggtttcaag agagfeccccc 2280 
tggggtactc tctttgcgcc tatccgaacc tctagttat:c tccaatggca tgcttgcgct 2340 
caaaatgggc aacggcctct ctctggacga ggccggcaac cttaectcec aaaatgtaac 2400 
cactgtgagc ccacctctca aaaaaaccaa gtcaaaca-ta aacctggaaa fcatctgcacc 24*60 
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cctcacagtt acctcagaag ccctaactgt ggctgccgcc gcacctctaa tggtcgcggg 2520 
caacacactc accatgcaat cacaggcccc gctaaccgtg cacgactcca aacttagcat 2580 
tgccacccaa ggacccctca cagtgtcaga aggaaagcta gccctgcaaa catcaggccc 2640 
cctcaccacc accgatagca gtacccttac tatcactgcc tcaccccctc taactactgc 2700 
cactggtagc ttgggcattg acttgaaaga gcccatttat acacaaaatg gaaaactagg 2760 
actaaagtac ggggctcctt tgcatgtaac agacgaccta aacactttga ccgtagcaac 2820 
tggtccaggt gtgactatta ataatacttc cttgcaaact aaagttactg gagccttggg 2880 
ttttgattca caaggcaata tgcaacttaa tgtagcagga ggactaagga ttgattctca 2940 
aaacagacgc cttatacttg atgttagtta tccgtttgat gctcaaaacc aactraaatrct 3000 
aagactagga cagggccctc tttttataaa ctcagcccac aacttggata ttaactacaa 3060 
caaaggcctt tacttgttta cagcttcaaa caattccaaa aagcttgagg ttaacctaag 3120 
cactgccaag gggttgatgt ttgacgctac agccatagcc attaatgcag gagatgggct 3180 
tgaatttggt tcacctaatg caccaaacac aaatcccctc aaaacaaaaa ttggccatgg 3240 
cctagaattt gattcaaaca aggctatggt tcctaaacta ggaactggcc ttagttttga 3300 
cagcacaggt gccattacag taggaaacaa aaataatgat aagctaactt ^gtggaccac 3360 
accagctcca tctcctaact gtagactaaa tgcagagaaa gatgctaaac tcactttggt 3420 
cttaacaaaa tgtggcagtc aaatacttgc tacagtttca gttttggctg fctaaaggcag 3480 
tttggctcca atatctggaa cagttcaaag tgctcatctt attataagat ttgacgaaaa 3S40 
tggagtgcta ctaaacaatt ccttcctgga <:ccagaatat tggaacttta gaaatggaga 3600 
tcttactgaa ggcacagcct atacaaacgc tgttggattt atgcctaacc tatcagctta 3660 
tccaaaatct cacggtaaaa ctgccaaaag taacattgtc agtcaagttt acttaaacgg 3720 
agacaaaact aaacctgtaa cactaaccat tacactaaac ggtacacagg aaacaggaga 3780 
cacaactcca agtgcatact ctatgtcatt ttcatgggac tggtctggcc acaactacat 3840 
taatgaaata t.ttgccacat cctcttacac tttttcatac attgcccaag: aataaaagaa 3900 
gcggccgctc gagcatgcat ctagagggcc ctattctata gtgtcaccta aatgctagag 3960 
ctcgctgatc agcctcgact gtgccttcta gttgccagcc atctgttgtt tgcccctccc 4020 
ccgtgccttc cttgaccctg gaaggtgcca -ctcccactgt ccttfccctaa taaaatgagg 4080 
aaattgcatc gcattgtctg agtaggtgtc attctattct ggggggtggg gtggggcagg 4140 
acagcaaggg ggaggattgg gaagacaata gcaggcatgc tggggatgcg gtgggctcta 4200 
tggcttctga ggcggaaaga accagctggg gctctagggg gtatccccac gcgccctgta 4260 
gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca 4320 
gcgccctagc gcccgctcct ttcgctttct tcccttcctt -tctcgccacg ttcgccggct 4380 
ttccccgtca agctctaaat cggggcatcc ctttagggtt ccgatttagt gctttacggc 4440 
acctcgaccc caaaaaactt gattagggtg atggttcacg tagtgggcca fccgccctgat 4500 
agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc 4560 
aaactggaac aacactcaac cctatctcgg tctattcttt tgatttataa <jggattttgg 4^620 
ggatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatetaac gcgaattaat 4680 
tctgtggaat gtgtgtcagt tagggtgtgg aaagtcccca ggctccccag gcaggcagaa 4740 
gtatgcaaag catgcatctc aattagtcag caaccaggtg tggaaagtcc ocaggctccc 4800 
cagcaggcag aagtatgcaa agcatgcatc tcaattagtc agcaaocata gtcccgeccc 4860 
taactccgcc catcccgccc ctaactccgc ccagttccgc ecattctccg ccccatggct 4920 
gactaatttt ttttatttat gcagaggccg aggccgcctc tgcctctgag ctatfeccaga 4980 
agtagtgagg aggctttttt ggaggcctag gcttttgcaa aaagctoccg ggagcttgta S040 
tatccatttt cggatctgat caagagacag gatgaggatc gtttcgcatg attgaacaag 5100 
atggattgca cgcaggttct ccggccgctt gggtggagag gctattcggc 1:atgactggg 5160 
cacaacagac aatcggctgc tctgatgccg ccgtgttccg gctgtcagcg caggggcgcc S220 
cggttctttt tgtcaagacc gacctgtccg gtgccctgaa tgaactgcag gacgaggcag 5280 
cgcggctatc gtggctggcc acgacgggcg ttccttgcgc agctgt<jctc gacgttgfeca 5340 
ctgaagcggg aagggactgg ctgctattgg gcgaagtgcc ggggcaggat otcctgtcat 5400 
ctcaccttgc tcctgccgag aaagtatcca tcatggctga tgcaatgcgg oggctgcata S4€0 
cgcttgatcc ggctacctgc ccattcgacc accaagcgaa acafccgcatc gagcgagcac 5520 
gtactcggat ggaagccggt cttgtcgatc aggatgatct ggacgaagag catcaggggc 5580 
tcgcgccagc cgaactgttc gccaggctca aggcgcgcat gcccgacggc gaggafcctcg 5640 
tcgtgaccca tggcgatgcc tgcttgccga atatcatggt ggaaaatggc cgcttttctg 5700 
gattcatcga ctgtggccgg ctgggtgtgg cggaccgcta tcaggacata gcgttggcta 5760 
cccgtgatat tgctgaagag cttggcggcg aatgggctga ccgcetcctc gtgctttacg 5820 
gtatcgccgc tcccgattcg cagcgcatcg ccttctatcg cctfccttgac gagttcttct 5880 
gagcgggact ctggggttcg aaatgaccga ccaagcgacg cccaacctgc catcacgaga 5940 
tttcgattcc accgccgcct tctatgaaag gttgggcttc ggaatcgttt tccgggacgc <6000 
cggctggatg atcctccagc gcggggatct catgctggag ttcttcgocc accccaactt ^6060 
gtttattgca gcttataatg gttacaaata aagcaatagc atcacaaat* -tcacaaataa €120 
agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg ^tafccttatca «180 
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tgtctgtata ccgtcgacct ctagctagag cttggcgtaa tcatggtcat agctgtttcc 6240 
tgtgtgaaat tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg 6300 
taaagcctgg ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc 6360 
cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg 6420 
gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg -ctcactgact cgctgcgctc -6480 
ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatocac 6S40 
agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 6600 
ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 6660 
caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 6720 
gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 6780 
cctgtccgcc tttctccctt cgggaagcgt ggcgctttct caatgctcac gctgtaggta €840 
tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac -cccccgttca 6900 
gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga €960 
cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 7020 
tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 7080 
tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 7140 
caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 7200 
aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg cfccagtggaa 7260 
cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 7320 
ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 7380 
tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 7440 
atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 7500 
tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 7560 
aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgccfcc 7620 
catccagtct attaattgtt gccgggaagc tagagtaagt agttcgcdag ttaatagttt 7680 
gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 7740 
ttcattcagc tccggttccc aacgafccaag gcgagttaca tgatccccca tgttgtgcaa 7800 
aaaagcggtt agctccttcg gtcctccgat cgttgt-caga agtaagttgg ccgcagtgtt 7860 
atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat -ccgtaagatg 7920 
cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 7980 
gagttgctct tgcccggcgt caatacggga taataccgcg -ccacatagca gaactttaaa 8040 
agtgctcatc attggaaaac gttctfccggg gcgaaaactc tcaaggatct taccgctgtt 8100 
gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 8160 
caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 6220 
ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 8280 
tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 8340 
aggggttccg cgcacatttc cccgaaaagt gccacctgac gtc 8383 

<210> 30 
<211> 7960 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid pDV67 
<400> 30 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc <tgct<ztgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca> acaaggcaag gcttgaocga ^raattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc -cagatatacg cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgaoc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg -taaatggccc gcctggcatt 540 
atgcccagta catgacctta tgggactttc <:tacttggca gtacatctac <jtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg €60 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg tt-ttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc -cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gotggctagc 900 
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gtttaaactt aagcttggta ccgagctcgg 
gggccagctg ttggggtgag tactccctct 
tgtcagtttc caaaaacgag gaggatttga 
gggtggccgc atccatctgg tcagaaaaga 
acgacccgta gagggcgttg gacagcaact 
cgcgatcggc gcgctccttg gccgcgatgt 
gccattcggg aaagacggtg gtgcgctcgt 
tgtgcagggt gacaaggtca acgctggtgg 
agcagaggcg gccgcccttg cgcgagcaga 
ccggggggtc tgcgtccacg gtaaagaccc 
tcttgcatcc ttgcaagtct agcgcctgct 
atgggttgag tgggggaccc catggcatgg 
aaatgtcgta aacgtagagg ggctctctga 
caccgcggat gctggcgcgc acgtaatcgt 
gaccgaggtt gctacgggcg ggctgctctg 
gtgagttgga tgatatggtt ggacgctgga 
ccgcgtcacg cacgaaggag gcgtaggagt 
cctgcacgtc tagggcgcag tagtccaggg 
cctttttttt ccacagctcg cggttgagga 
ggatcggaaa cccgtcggcc tccgaacgag 
gagtccgcat cgaccggatc ggaaaacctc 
caagatccaa gatgaagcgc gcaagaccgt 
atgacacgga aaccggtcct ccaactgtgc 
atgggtttca agagagtccc cctggggtac 
cctccaatgg catgcttgcg ctcaaaatgg 
accttacctc ccaaaatgta accactgtga 
taaacctgga aatatctgca cccctcacag 
ccgcacctct aatggtcgcg ggcaacacac 
tgcacgactc caaacttagc attgccaccc 
tagccctgca aacatcaggc cccctcacca 
cctcaccccc tctaactact gccactggta 
atacacaaaa tggaaaacta ggactaaagt 
taaacacttt gaccgtagca actggtccag 
ctaaagttac tggagccttgggttttgatt 
gaggactaag gattgattct caaaacagac 
atgctcaaaa ccaactaaat ctaagactag 
acaacttgga tattaactac aacaaaggcc 
aaaagcttga ggttaaccta agcactgcca 
ccattaatgc aggagatggg cttgaatttg 
tcaaaacaaa aattggccat ggcctagaat 
taggaactgg ccttagtttt gacagcacag 
ataagctaac tttgtggacc acaccagctc 
aagatgctaa actcactttg gtcttaacaa 
cagttttggc tgttaaaggc agtttggctc 
ttattataag atttgacgaa aatggagtgc 
attggaactt tagaaatgga gatcttactg 
ttatgcctaa cctatcagct tatccaaaat 
tcagtcaagt ttacttaaac ggagacaaaa 
acggtacaca ggaaacagga gacacaactc 
actggtctgg ccacaactac attaatgaaa 
acattgccca agaataaaag aagcggccgc 
gatcagcctc gactgtgcct tctagttgcc 
cttccttgac cctggaaggt gccactccca 
catcgcattg tctgagtagg tgtcattcta 
agggggagga ttgggaagac aatagcaggc 
ctgaggcgga aagaaccagc tggggctcta 
cattaagcgc ggcgggtgtg gtggttacgc 
tagcgcccgc tcctttcgct ttcttccctt 
gtcaagctct aaatcggggc atccctttag 
accccaaaaa acttgattag ggtgatggtt 
tttttcgccc tttgacgttg gagtccacgt 
gaacaacact caaccctatc tcggtctatt 



atccactctc ttccgcatcg ctgtetgcga 960 
gaaaagcggg catgacttct gcgctaagat 1020 
tattcacctg gcccgcggtg atgcctttga 1080 
caatcttttt gttgtcaagc ttggtggcaa 1140 
tggcgatgga gcgcagggtt tggtttttgt 1200 
ttagctgcac gtattcgcgc gcaacgcacc 1260 
cgggcaccag gtgcacgcgc caaccgcggt 1320 
ctacctctcc gcgtaggcgc tcgttggtcc 1380 
atggcggtag ggggtctagc *gcgtctcgt 1440 
cgggcagcag gcgcgcgtcg aagtagtcta 1500 
gccatgcgcg ggcggcaagc gcgcgctcgt 1560 
ggtgggtgag cgcggaggcg tacatgccgc 1620 
gtattccaag atatgtaggg tagcatcttc 1680 
atagttcgtg cgagggagcg aggaggtcgg 1740 
ctcggaagac tatctgcctg aagatggcat 1800 
agacgttgaa gctggcgtct gtgagaccta 1860 
cgcgcagctt gttgaccagc tcggcggtga 1920 
tttccttgat gatgtcatac ttzatcctgtc 1980 
caaactcttc gcggtctttc -cagtactctt 2040 
atccgtactc cgccgccgag ggacctgagc 2100 
tcgagaaagg cgtctaacca gtcacagtcg 2160 
ctgaagatac cttcaacccc gtgtatccat 2220 
cttttcttac tcctcccttt gtat-ccecca 2280 
tctctttgcg cctatccgaa cctctagtta 2340 
gcaacggcct ctctctggac gaggccggca 2400 
gcccacctct caaaaaaacc aagtcaaaca 2460 
ttacctcaga agccctaact gtggctgccg 2520 
tcaccatgca atcacaggcc ccgctaaccg 2580 
aaggacccct cacagtgtca gaaggaaagc 2640 
ccaccgatag cagtaccctt actafccactg 2700 
gcttgggcat t:gacttgaaa gagcccattt 2760 
acggggctcc tttgcatgta acagacgaec 2820 
gtgtgactat taataatact tccttgcaaa 2880 
cacaaggcaa tatgcaactt aatgtagcag 2940 
gccttatact tgatgttagt tatccgttfcg 3000 
gacagggccc tctttttata aact<^gccc 3060 
tttacttgtt tacagcttca aacaattcca 3120 
aggggttgat gtttgacgct acagccatag 3 ISO 
gttcacctaa tgcaccaaac acaaatcecc 3240 
ttgatfccaaa caaggctatg gttcctaaac 3300 
gtgccattac agtaggaaac aaaaataatg 3360 
catctcctaa ctgtagacta aatgcagaga 3420 
aatgtggcag tcaaatactt gctacagttt 3480 
caatatctgg aacagttcaa agtgctcatc 3540 
tactaaacaa ttccttcctg gacecagaat 3600 
aaggcacagc otatacaaac gctgttggat 3660 
ctcacggtaa aactgccaaa agtaacattg 3720 
ctaaacctgt aacactaacc attacactaa 3780 
caagtgcata ctctatgtca ttttcatggg 3840 
tatttgccac atcctcttac actttt-tcat 3900 
tcgagtctag agggcccgtt taaacccgct 3960 
agccatctgt tgtttgcccc ^cccccgtgc 4020 
ctgtcctttc ctaataaaat gaggaaattg 4080 
ttctgggggg tggggtgggg caggacagca 4140 
atgctgggga tgcggtgggc tctatggctt 4200 
gggggtafccc.ccacgcgccc tgtagcggcg 4260 
gcagcgtgac cgctacactt gccagcgccc 4320 
cctttctcgc cacgttcgcc ggctttcccc 4380 
ggttccgatt tagtgct-tta -cggcacctcg 4440 
cacgtagtgg gccatcgccc tgatagacgg 4S00 
t-ctttaatag tggactcttg ttrccaaactg 4^560 
cttttgattt ataagggatt ttggggafctt 4*620 
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cggcctattg gttaaaaaat gagctgattt aacaaaaatt taacgcgaat taattctgtg 4680 
gaatgtgtgt cagttagggt gtggaaagtc cccaggctcc ccaggcaggc agaagtatgc 4740 
aaagcatgca tctcaattag tcagcaacca ggtgtggaaa gtccccaggc tccccagcag 4800 
gcagaagtat gcaaagcatg catctcaatt agtcagcaac catagtcccg cccctaactc 4860 
cgcccatccc gcccctaact ccgcccagtt ccgcccattc tccgccccat ggctgactaa 4920 
ttttttttat ttatgcagag gccgaggccg cctctgcctc tgagctattc cagaagtagt 4980 
gaggaggctt ttttggaggc ctaggctttt gcaaaaagct cccgggagct tgtatatcca 5040 
ttttcggatc tgatcagcac gtgttgacaa ttaatcatcg gcatagtata tcggcatagt 5100 
ataatacgac aaggtgagga actaaaccat ggccaagttg accagtgccg ttccggtgct 5160 
caccgcgcgc gacgtcgccg gagcggtcga gttctggacc gaccggctcg ggttctcccg 5220 
ggacttcgtg gaggacgact tcgccggtgt ggtccgggac gacgtgaccc tgttcatcag 5280 
cgcggtccag gaccaggtgg tgccggacaa caccctggcc tgggtgtggg tgcgcggcct 5340 
ggacgagctg tacgccgagt ggtcggaggt cgtgtccacg aacttccggg acgcctccgg 5400 
gccggccatg accgagatcg gcgagcagcc gtgggggcgg gagttcgocc tgcgcgaccc 5460 
ggccggcaac tgcgtgcact tcgtggccga ggagcaggac tgacacgtgc -tacgagattt 5520 
cgattccacc gccgccttct atgaaaggtt gggcttcgga atcgttttcc gggacgccgg 5S80 
ctggatgatc ctccagcgcg gggatctcat gctggagttc ttcgcccacc ccaacttgtt 5640 
tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 5700 
atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 5760 
ctgtataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc tgtttcctgt 5820 
gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa 5880 
agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct cactgcccgc 5940 
tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac gcgcggggag 6000 
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 6060 
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 6120 
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 6180 
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 6240 
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 6300 
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 6360 
gtccgccttt ctcccttcgg gaagcgtggc gctttctcaa tgctcacgct gtaggtatct 6420 
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgtfccagcc 6480 
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 6540 
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggt^c 6600 
tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat 6660 
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gafcccggcaa 6720 
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa €780 
aaaaggatct caagaagatc ctttgatctt ttctacgggg totgacgctc agtggaacga 6840 
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 6900 
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 6960 
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 7020 
catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 7080 
ccccagtgct gcaatgatac cgcgagaccc acgct<:accg gctccagatt tat<cagcaat 7140 
aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 7200 
ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagt-ttgcg 7260 
caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 7320 
attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 7380 
agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagfrtggccg <:agtgttato 7440 
actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 7500 
ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 7560 
ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 7620 
gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 7680 
atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 7740 
cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 7800 
gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 7860 
gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 7920 
ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc 7960 

<210> 31 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 31 

atgggatcca agatgaagcg cgcaagaccg 30 

<210> 32 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 32 

cactatagcg gccgcattct cagtcatctt 30 

<210> 33 
<2il> 7989 

<212> DNA . :. .:- . 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: plasmid pDV69 
<400> 33 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctcfcgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgtfcgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta -catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt S40 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacat-caa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa -tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg -caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct <*ggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 900 
gtttaaactt aagcttggta ccgagctcgg atccactctc ttccgcatcg ctgtctgcga 960 
gggccagctg ttggggtgag tactccctct gaaaagcggg -catgacttct gcgctaagat 1020 
tgtcagtttc caaaaacgag gaggatttga tattcacctg gcccgcggtg atgcctttga 1080 
gggtggccgc atccatctgg tcagaaaaga raatcttttt gttgtcaagc ttggtggcaa 1140 
acgacccgta gagggcgttg gacagcaact tggcgatgga gcgcagggtt tggtttttgt 1200 
cgcgatcggc gcgctccttg gccgcgatgt ttagctgcac gtattcgcgc gcaacgcacc 1260 
gccattcggg aaagacggtg gtgcgctcgt cgggcaccag gtgcacgcgc caaccgcggt 1320 
tgtgcagggt gacaaggtca acgctggtgg ctacctctcc gcgtaggcgc fccgttggtcc 1380 
agcagaggcg gccgcccttg cgcgagcaga atggcggtag ggggtctagc tgcgtctcgt 1440 
ccggggggtc tgcgtccacg gtaaagaccc cgggcagcag gcgcgcgtcg aagtagtcta 1SO0 
tcttgcatcc ttgcaagtct agcgcctgct gccatgcgcg gg c 99^ aa 9C gcgcgctcgt XS€0 
atgggttgag tgggggaccc catggcatgg ggtgggtgag cgcggaggcg tacatgccgc 1620 
aaatgtcgta aacgtagagg ggctctctga gtattccaag atatgtaggg tagcatcttc 1680 
caccgcggat gctggcgcgc acgtaatcgt atagttcgtg cgagggagcg aggaggfccgg 1740 
gaccgaggtt gctacgggcg ggctgctctg ctcggaagac tatctgcctg aagatggcat 1800 
gtgagttgga tgatatggtt ggacgctgga agacgttgaa gctggcgtct gtgagaccta 1860 
ccgcgtcacg cacgaaggag gcgtaggagt cgcgcagctt gttgaccagc tcggcggtga 1920 
cctgcacgtc tagggcgcag tagtccaggg tttccttgat gatgtcatac ttatcctgtc 1980 
cctttttttt ccacagctcg cggttgagga caaactcttc gcggtctttc cagtactctt 2040 
ggatcggaaa cccgtcggcc tccgaacgag atccgtactc -cgccgccgag ggacctgagc 2100 
gagtccgcat cgaccggatc ggaaaacctc tcgagaaagg <:gtctaacca gt^cacagfccg 2160 
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caagatccaa gatgaagcgc gcaagaccgt ctgaagatac cttcaacccc gtgtatccat 2220 
atgacacgga aaccggtcct ccaactgtgc cttttcttac tcctcccttt gtatccccca 2280 
atgggtttca agagagtccc cctggggtac tctctttgcg cctatccgaa -cctctagtta 2340 
cctccaatgg catgcttgcg ctcaaaatgg gcaacggcct <rtctctggac gaggccggca 2400 
accttacctc ccaaaatgta accactgtga gcccacctct caaaaaaacc aagtcaaaca 24*60 
taaacctgga aatatctgca cccctcacag ttacctcaga agccctaact gtggctgecg 2520 
ccgcacctct aatggtcgcg ggcaacacac tcaccatgca atcacaggcc ccgctaaccg 2580 
tgcacgactc caaacttagc attgccaccc aaggacccct cacagtgtca gaaggaaagc 2640 
tagccctgca aacatcaggc cccctcacca ccaccgatag cagtaccctt actatcactg 2700 
cctcaccccc tctaactact gccactggta gcttgggcat tgacttgaaa gagcccattt 2760 
atacacaaaa tggaaaacta ggactaaagt acggggctcc tttgcatgta acagacgacc 2820 
taaacacttt gaccgtagca actggtccag gtgtgactat taataatact tccttgcaaa 2880 
ctaaagttac tggagccttg ggttttgatt cacaaggcaa tatgcaactt aatgtagcag 2940 
gaggactaag gattgattct caaaacagac gccttatact tgatgttagt tatccgtttg 3000 
atgctcaaaa ccaactaaat ctaagactag gacagggccc tctttttata aactcagccc 3060 
acaacttgga tattaactac aacaaaggcc tttacttgtt tacagcttca aacaattcca 3120 
aaaagcttga ggttaaccta agcactgcca aggggttgat gtttgacgct acagccatag 3180 
ccattaatgc aggagatggg cttgaatttg gttcacctaa tgcaccaaac acaaatcccc 3240 
tcaaaacaaa aattggccat ggcctagaat ttgattcaaa caaggctatg gttcctaaac 3300 
taggaactgg ccttagtttt gacagcacag gtgccattac agtaggaaac aaaaataatg 3360 
ataagctaac tttgtggacc ggtccaaaac cagaagccaa ctgcataatt gaatacggga 3420 
aacaaaaccc agatagcaaa ctaactttaa tccttgtaaa aaatggagga attgttaatg 3480 
gatatgtaac gctaatggga gcctcagact acgttaacac cttatttaaa aacaaaaatg 3S40 
tctccattaa tgtagaacta tactttgatg ccactggtca tatattacca gactcafcctt 3600 
ctcttaaaac agatctagaa ctaaaataca agcaaaccgc tgactttagt gcaagaggtt 3660 
ttatgccaag tactacagcg tatccatttg tccttcctaa tgcgggaaca «cataatgaaa 3720 
attatatttt tggtcaatgc tactacaaag caagcgatgg tgccctt-t-tt ccgttggaag. 3780 
ttactgttat gcttaataaa cgcctgccag atagtcgcac atcctatgtt atgacttttt 3840 
tatggtcctt gaatgctggt ctagctccag aaactactca ggcaaccctc ataacetccc 3900 
catttacctt ttcctatatt agagaagatg actgattttt aagaagcggc cgctcgagtc 3960 
tagagggccc gtttaaaccc gctgatcagc ctcgactgtg ccttctagtt gccagccatc 4020 
tgttgtttgc ccctcccccg tgccttcctt gaccctggaa ggtgocactc ccactgtcct 4080 
ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg 4140 
gggtggggtg gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg 4200 
ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc snccntagct ggggctctag 4260 
ggggtatccc cacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 4320 
cagcgtgacc gctacacttg ccagcgccct agcgcecgct ccttt-cgctt tctteccttc 4380 
ctttctcgcc acgttcgccg gctttccccg tcaagcfccta aatcggggca fcccctttagg 4440 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 4500 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 4S60 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 4620 
ttttgattta taagggattt tggggatttc ggcctattgg t-taaaaaatg agctgattta 4680 
acaaaaattt aacgcgaatt aattctgtgg aatgtgtgtc agttagggtg tggaaagtcc 4740 
ccaggctccc caggcaggca gaagtatgca aagcatgcat <:fccaattagt cagcaaccag 4800 
gtgtggaaag tccccaggct ccccagcagg cagaagtatg caaagcatgc atctcaat£a 4860 
gtcagcaacc atagtcccgc ccctaactcc gcccatcccg -cccctaactc ogcccagttc 4920 
cgcccattct ccgccccatg gctgactaat tttttttatt tatgcagagg ccgaggccgc 4980 
ctctgcctct gagctattcc agaagtagtg aggaggcttt tttggaggcc -taggcttttg S040 
caaaaagctc ccgggagctt gtatatccat tttcggatct gatcagcacg tgttgacaat S100 
taatcatcgg catagtatat cggcatagta taatacgaca aggtgaggaa ctaaaccatg 51*60 
gccaagttga ccagtgccgt tccggtgctc accgcgcgcg acgtcgccgg agcggtcgag 5220 
ttctggaccg accggctcgg gttctcccgg gacttcgtgg aggacgactt <:gccggtgtg 5280 
gtccgggacg acgtgaccct gttcatcagc gcggtccagg accaggtggt gccggacaac 5340 
accctggcct gggtgtgggt gcgcggcctg gacgagctgt acgccgagtg gt-cggaggtc 5400 
gtgtccacga acttccggga cgcctccggg ccggccatga ccgagatcgg xrgagcagccg 5460 
t 99999cggg agttcgccct gcgcgacccg gccggcaact gcgtgcactt cgtggccgag 5S20 
gagcaggact gacacgtgct acgagatttc gattccaccg ccgccttcta £gaaagg<:tg 5580 
ggcttcggaa tcgttttccg ggacgccggc tggatgatcc fcccagcgcgg ggatctcatg S640 
ctggagttct tcgcccaccc caacttgttt attgcagctt ataatggfcta caaataaagc 5700 
aatagcatca caaatttcac aaataaagca tttttttcac -tgcattctag ttgtggtttg 5760 
tccaaactca tcaatgtatc ttatcatgtc tgtataccgt cgacctctag ctagagcttg 5820 
gcgtaatcat ggtcatagct gtttcctgtg tgaaat-tgtt atccgcfccac aattccacac 5880 
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aacatacgag 
acattaattg 
cattaatgaa 
tcctcgctca 
tcaaaggcgg 
gcaaaaggcc 
aggctccgcc 
ccgacaggac 
gttccgaccc 
ctttctcaat 
ggctgtgtgc 
cttgagtcca 
attagcagag 
ggctacacta 
aaaagagttg 
gtttgcaagc 
tctacggggt 
ttatcaaaaa 
taaagtatat 
atctcagcga 
actacgatac 
cgctcaccgg 
agtggtcctg 
gtaagtagtt 
gtgtcacgct 
gttacatgat 
gtcagaagta 
cttactgtca 
ttctgagaat 
accgcgccac 
aaactctcaa 
aactgatctt 
caaaatgccg 
ctttttcaat 
gaatgtattt 
cctgacgtc 



ccggaagcat 
cgttgcgctc 
tcggccaacg 
ctgactcgct 
taatacggtt 
agcaaaaggc 
cccctgacga 
tataaagata 
tgccgcttac 
gctcacgctg 
acgaaccccc 
acccggtaag 
cgaggtatgt 
gaaggacagt 
gtagctcttg 
agcagattac 
ctgacgctca 
ggatcttcac 
atgagtaaac 
tctgtctatt 
gggagggctt 
ctccagattt 
caactttatc 
cgccagttaa 
cgtcgtttgg 
cccccatgtt 
agttggccgc 
tgccatccgt 
agtgtatgcg 
atagcagaac 
ggatcttacc 
cagcatcttt 
caaaaaaggg 
attattgaag 
agaaaaataa 



aaagtgtaaa 
actgcccgct 
cgcggggaga 
gcgctcggtc 
atccacagaa 
caggaaccgt 
gcatcacaaa 
ccaggcgttt 
cggatacctg 
taggtatctc 
cgttcagccc 
acacgactta 
aggcggtgct 
atttggtatc 
atccggcaaa 
gcgcagaaaa 
gtggaacgaa 
ctagatcctt 
ttggtctgac 
tcgttcatcc 
accatctggc 
atcagcaata 
cgcctccatc 
tagtttgcgc 
tatggcttca 
gtgcaaaaaa 
agtgttatca 
aagatgcttt 
gcgaccgagt 
tttaaaagtg 
gctgttgaga 
tactttcacc 
aataagggcg 
catttatcag 
acaaataggg 



gcctggggtg 
ttccagtcgg 
ggcggtttgc 
gttcggctgc 
tcaggggata 
aaaaaggccg 
aatcgacgct 
ccccctggaa 
tccgcctttc 
agttcggtgt 
gaccgctgcg 
tcgccactgg 
acagagttct 
tgcgctctgc 
caaaccaccg 
aaaggatctc 
aactcacgtt 
ttaaattaaa 
agttaccaat 
atagttgcct 
cccagtgctg 
aaccagccag 
cagtctatta 
aacgttgttg 
ttcagctccg 
gcggttagct 
ctcatggtta 
tctgtgactg 
tgctcttgcc 
ctcatcattg 
tccagttcga 
agcgtttctg 
acacggaaat 
ggttattgtc 
gttccgcgca 



cctaatgagt 
gaaacctgtc 
gtattgggcg 

ggcgagcggt 

acgcaggaaa 
cgttgctggc 
xaagtcagag 
gctccctcgt 
tcccttcggg 
aggtcgttcg 
ccttatccgg 
cagcagccac 
tgaagtggtg 
tgaagccagt 
-ctggtagcgg 
aagaagatcc 
aagggatttt 
aatgaagttt 
gcttaatcag 
gactccccgt 
caatgatacc 
ccggaagggc 
attgttgccg 
ccattgctac 
gttcccaacg 
ccttcggtcc 
tggcagcact 
gtgagtactc 
cggcgfccaat 
gaaaacgttc 
tgtaacccac 
ggtgagcaaa 
gttgaatact 
tcatgagcgg 
catttccccg 



gagctaactc 
gtgccagctg 
ctcttccgct 
atcagctcac 
gaacatgtga 
gtttttccat 
gtggcgaaac 
gcgctctcct 
aagcgtggcg 
♦ctccaagctg 
taactafccgt 
tggtaacagg 
gcctaactac 
'taccttcgga 
tggttttttt 
tttgatcttt 
ggtcatgaga 
taaatcaatc 
tgaggcacct 
<:gtgtagata 
gcgagaecca 
cgagcgcaga 
ggaagctaga 
aggcatcgtg 
atcaaggcga 
tccgatcgtt 
gcataattct 
aaccaagtca 
acgggataat 
ttcggggcga 
tcgtgcaccc 
aacaggaagg 
catactcttc 
atacatattt 
aaaagtgcca 



5940 
€000 
€060 
'6120 
€180 
€240 
6300 
€360 
6420 
€480 
6540 
€600 
6660 
€720 
€780 
€840 
6900 
€960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
7989 



<210> 34 
<211> 7607 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 



plasmid t3R£5-£l^SV40-Hygro 



<400> 34 

tctagaagat 

ttctagctac 

gtacaggatg 

agatcgatct 

gcgcgttcgt 

aacttcaggg 

atgttatatg 

tatcaccatg 

ttgtctcctc 

ttgtaacgaa 

tcactttttt 

acaattgtta 

ggaaatattc 

tggttacaat 

ccctctgcta 

gttattgtgc 



ccgctgtaca 
tttattagat 
ttctagctac 
cc tggccgtt 
cctcactctc 
tgagtttggg 
gagggggcaa 
gaccctcatg 
ttattttctt 
tttttaaatt 
ttcaaggcaa 
taattaaatg 
ttattggtag 
gatatacact 
accatgttca 
tgtctcatca 



ggatgttcta 
ccgctgtaca 
tttattagat 

cggggtcaaa 

ttccgcatcg 
gacccttgat 
agttttcagg 
ataattttgt 
ttcattttct 
cacttttgtt 
tcagggtata 
ataaggtaga 
aaacaactac 
gtttgagatg 
tgccttcttc 
ttttggcaaa 



gctactttat 
ggatgttcta 
ccgtgtacag 
aaccaggttt 
ctgtctgcga 
tgttctttct 
gtgttgttta 
ttctttcact 
gtaacttttt 
tatttgtcag 
ttatattgta 
atatttctgc 
atcctggtca 
aggataaaat 
tttttcctac 
gaattagatc 



tagatccgct 
gctactttat 
gatgttctag 
ggctataaaa 
gggecaggat 
ttttcgctat 
gaatgggaag 
ttctactctg 
cgttaaactt 
attgtaagta 
cttcagcaca 
atataaattc 
tcatcctgcc 
actctgagtc 
agctcctggg 
taagcttctg 



gtacaggatg €0 
tagatccgct 120 
ctactttatt 180 
gggggtgggg 240 
cgatcctgag 300 
-tgtaaaattc 3€0 
atgtcccttg 420 
ttgacaacca 480 
*tagcttgcat S40 
ctttctctaa 600 
gttttagaga 6€0 
tggctggcgt 720 
/tttctcttta 780 
xraaaccgggc 840 
-caacgtgctg 900 
.cagctcgagg 960 



# 
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actcggtcga ctgaaaatga gacatattat ctgccacgga ggtgttatta ccgaagaaat 1020 
ggccgccagt cttttggacc agctgatcga agaggtactg gctgataatc ttccacctcc 1080 
tagccatttt gaaccaccta cccttcacga actgtatgat ttagacgtga -cggcccccga 1140 
agatcccaac gaggaggcgg tttcgcagat ttttcccgac tctgtaatgt tggcggtgca 1200 
ggaagggatt gacttactca cttttccgcc ggcgcccggt tctccggagc cgcctcacct 1260 
ttcccggcag cccgagcagc cggagcagag agccttgggt ccggtttcta tgccaaacct 1320 
tgtaccggag gtgatcgatc ttacctgcca cgaggctggc tttccaccca gtgacgacga 1380 
ggatgaagag ggtgaggagt ttgtgttaga ttatgtggag -caccccgggc acggttgcag 1440 
gtcttgtcat tatcaccgga ggaatacggg ggacccagat attatgtgtt cgctttgcta 1*500 
tatgaggacc tgtggcatgt ttgtctacag taagtgaaaa ttatgggcag tgggtgatag 1560 
agtggtgggt ttggtgtggt aatttttttt ttaattttta cagttttgtg gtttaaagaa 1620 
ttttgtattg tgattttttt aaaaggtcct gtgtctgaac ctgagcctga gcccgagcca 1680 
gaaccggagc ctgcaagacc tacccgccgt cctaaaatgg cgcctgctat cctgagacgc 1740 
ccgacatcac ctgtgtctag agaatgcaat agtagtacgg atagctgtga ctccggtect 1800 
tctaacacac ctcctgagat acacccggtg gtcccgctgt gccccattaa accagttgcc 1860 
gtgagagttg gtgggcgtcg ccaggctgtg gaatgtatcg aggacttgct taacgagcct 1920 
gggcaacctt tggacttgag ctgtaaacgc cccaggccat aaggtgtaaa cctgtgattg 1980 
cgtgtgtggt taacgccttt gtttgctgaa tgagttgatg taagtttaat aaagggtgag 2040 
ataatgttta acttgcatgg cgtigttaaat ggggcggiggc ttaaagggta tataatgcgc 2100 
cgtgggctaa tcttggttac atctgacctc atggaggctt gggagtgttt ggaagatttt 2160 
tctgctgtgc gtaacttgct ggaacagagc tctaacagta -cctcttggtt ttggaggttt "2220 
ctgtggggct catcccaggc aaagttagtc tgcagaatta aggaggatta caagtgggaa 2280 
tttgaagagc ttttgaaatc ctgtggtgag ctgtttgatt ctttgaatct gggtcaccag 2340 
gcgcttttcc aagagaaggt catcaagact ttggattttt ccacaccggg gcgcgctgcg 2400 
gctgctgttg cttttttgag ttttataaag gafcaaatgga gcgaagaaac ccatctgagc 2460 
ggggggtacc tgctggattt tctggccatg catctgtgga gagcggttgt gagacacaag 2520 
aatcgcctgc tactgttgtc ttccgtccgc ccggcgataa taccgacgga ggagcagcag 2580 
cagcagcagg aggaagccag gcggcggcgg caggagcaga gcccatggaa cocgagagcc 2640 
ggcctggacc ctcgggaatg aatgttgtac aggtggctga actgtatcca gaactgagac 2700 
gcattttgac aattacagag gatgggcagg ggctaaaggg ggtaaagagg gagcgggggg 2760 
cttgtgaggc tacagaggag gctaggaatc tagcttttag cttaatgacc agacaccgtc 2820 
ctgagtgtat tacttttcaa cagatcaagg ataattgcgc taatgagctt gatctgctgg 2880 
cgcagaagta ttccatagag cagctgacca cttactggct gcagccaggg gatgattttg 2940 
aggaggctat tagggtatat gcaaaggtgg cacttaggcc agattgcaag tacaagatca 3000 
gcaaacttgt aaatatcagg aattgttgct acatttctgg gaacggggcc gaggtggaga 3060 
tagatacgga ggatagggtg gcctttagat gtagcatgat aaatatgtgg ccgggggtgc 3120 
ttggcatgga cggggtggtt attatgaatg taaggtttac tggccccaat tttagcggta 3180 
cggttttcct ggccaatacc aaccttatcc tacacggtgt aagcttctat ^gggtttaaca 3240 
atacctgtgt ggaagcctgg accgatgtaa gggttcgggg ctgtgccttt tactgctgct 3300 
ggaagggggt ggtgtgtcgc cccaaaagca gggcttcaat taagaaatgc *ctctttgaaa 3360 
ggtgtacctt gggtatcctg tctgagggta actccagggt gcgccacaat gtggcctccg 3420 
actgtggttg cttcatgcta gtgaaaagcg tggctgtgat taagcataac atggtatgtg 3480 
gcaactgcga ggacagggcc tctcagatgc tgacctgctc ggacggcaac «tgtcacctgc 3540 
tgaagaccat tcacgtagcc agccactctc gcaaggcctg gccagtgttt gagcataaca 3600 
tactgacccg ctgttccttg catttgggta acaggagggg ggtgttccta ccttaccaat 3^60 
gcaatttgag tcacactaag atattgcttg agcccgagag catgtccaag gtgaacctga 3720 
acggggtgtt tgacatgacc atgaagatct ggaaggtgct gaggtacgat gagacccgca 3780 
ccaggtgcag accctgcgag tgtggcggta aacatattag gaaccagcct gtgatgctgg 3840 
atgtgaccga ggagctgagg cccgatcact tggtgctggc ctgcacccgc gctgagtttg 3900 
gctctagcga tgaagataca gattgaggta ctgaaatgtg tgggcgtggc ttaagggtgg 3960 
gaaagaatat ataaggtggg ggtcttatgt agttttgtat ctgttttgca gcagccgccg 4020 
ccgccatgag caccaactcg tttgatggaa gcattgtgag ctcatatttg acaacgcgca 4080 
tgcccccatg ggccggggtg cgtcagaatg tgatgggctc cagcattgat ggtcgcoccg 4140 
tcctgcccgc aaactctact accttgacct acgagaccgt gtctggaacg ccgttggaga 4200 
ctgcagcctc cgccgccgct tcagccgctg cagccaccgc ccgcgggatt gtgactgaet 4260 
ttgctttcct gagcccgctt gcaagcagtg cagcttcccg ttcatccgcc .cgcgatgaca 4320 
agttgacggc tcttttggca caattggatt ctttgacccg ggaacttaat gtcgtttctc 4380 
agcagctgtt ggatctgcgc cagcaggttt ctgccctgaa ggcttcctcc cctcccaatg 4440 
cggtttaaaa cataaataaa aaaccagact ctgtttggat ttggatcaag xraagtgtctt 4500 
gctgtctcag ctgactgctt aagtcgcaag ccgaattgga fcccaattcgg atcgatctta 4SS0 
ttaaagcaga acttgtttat tgcagcttat aatggttaca aataaagcaa tagcatcaca 4620 
aatttcacaa ataaagcatt tttttcactg cattctagtt gtggtfctgtc -caaactcatc 4*680 
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aatgtatctt atcatgtctg gtcgactcta gactcttccg cttcctcgct cactgacbcg 4740 
ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg 4800 
ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag 4860 
gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac 4920 
gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga 4980 
taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt 5040 
accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc 5100 
tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaecc 51-60 
cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta S220 
agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat 5280 
gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac tagaaggaca S340 
gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct 5400 
tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt 5460 
acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct 5520 
cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 5580 
acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa S640 
acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta S700 
tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc S760 
ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 5820 
ttatcagcaa taaaccagcc agccggaagg .. gccgagcgca gaagtggtcc tgcaacttta 5880 
tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 5940 
aatagtttgc gcaacgttgt tgccattgct acaggcatcg tggtgtcacg ctcgtogttt 6000 
ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg 6060 
ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc 6120 
gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc 6180 
gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg 6240 
cggcgaccga gttgctcttg cccggcgtca atacgggata ataccgcgcc acatageaga 6300 
actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggafcctta ^3*60 
ccgctgttga gatccagttc gatgtaaccc actcgtgcac -ccaactgatc ttcagcatct 6420 
tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc -cgcaaaaaag -6480 
ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 6540 
agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 6600 
aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc 6660 
attattatca tgacattaac ctataaaaat aggcgtatca cgaggcccct ttcgtctcgc 6720 
gcgtttcggt gatgacggtg aaaacctctg acacatgcag ctcccggaga cggtcacagc 6780 
ttgtctgtaa gcggatgccg ggagcagaca agcccgtcag ggcgcgtcag cgggtgttgg 6840 
cgggtgtcgg ggctggctta actatgcggc atcagagcag attgtactga gagtgcacca 6900 
tatgcggtgt gaaataccgc acagatgcgt aaggagaaaa taccgcatca ggaaattgta 6960 
agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc attttttaac 7020 
caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga gatagggttg 7080 
agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc caacgtcaaa 7140 
gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc -ctaat<:aagt 7200 
tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag occccgattt 7260 
agagcttgac ggggaaagcc ggcgaacgtg . gcgagaaagg aagggaagaa agcgaaagga 7320 
gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac *cacacccgcc 7380 
gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg caactgttgg 7440 
gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg gggatgtgct 7500 
gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg taaaacgacg 7560 
gccagtgaat tgtaatacga ctcactatag ggcgaattaa ttcgggg 7607 



<210> 35 
<211> 11600 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: plasmid MMTV-E2a-SV40-Neo 
<400> 35 

gaattccgca ttgcagagat attgtattta agtgcctagc fccgatacaat aaacgccatt 60 
tgaccattca ccacattggt gtgcacctcc aagcttgggc agaaatggtt gaactrcocga 120 
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gagtgtccta cacctagggg agaagcagcc aaggggttgt ttcccaccaa ggacgacccg 180 
tctgcgcaca aacggatgag cccatcagac aaagacatat tcattctctg ctgcaaactt 240 
ggcatagctc tgctttgcct ggggctattg ggggaagttg cggttcgtgc tcgcagggct 300 
ctcacccttg actcttttaa tagctcttct gtgcaagatt acaatctaaa caattcggag 360 
aactcgacct tcctcctgag gcaaggacca cagccaactt cctcttacaa gccgcatcga 420 
ttttgtcctt cagaaataga aataagaatg cttgctaaaa attatatttt taccaataag 480 
accaatccaa taggtagatt attagttact atgttaagaa atgaatcatt atcttttagt 540 
actattttta ctcaaattca gaagttagaa atgggaatag aaaatagaaa gagacgcfcca €00 
acctcaattg aagaacaggt gcaaggacta ttgaccacag gcctagaagt aaaaaaggga €60 
aaaaagagtg tttttgtcaa aataggagac aggtggtggc aaccagggac ttatagggga 720 
ccttacatct acagaccaac agatgccccc ttaccatata caggaagata tgacttaaat 780 
tgggataggt gggttacagt caatggctat aaagtgttat atagatccct cccttttcgt 840 
gaaagactcg ccagagctag acctccttgg tgtatgttgt ctcaagaaga aaaagacgac 900 
atgaaacaac aggtacatga ttatatttat ctaggaacag gaatgcactt ttggggaaag 960 
attttccata ccaaggaggg gacagtggct ggactaatag aacattattc tgcaaaaact 1020 
catggcatga gttattatga atagccttta ttggcccaac cttgcggttc ccagggctta 1080 
agtaagtttt tggttacaaa ctgttcttaa aacgaggatg tgagacaagt ggtttcctga 1140 
cttggtttgg tatcaaaggt tctgatctga gctctgagtg ttctattttc ctatgttctt 1200 
ttggaattta tccaaatctt atgtaaatgc ttatgtaaac caagatataa aagagtgctg 1260 
attttttgag taaacttgca acagtcctaa cattcacctc ttgtgtgttt gtgtctgttc 1320 
gccatcccgt ctccgctcgt cacttatcct tcactttcca gagggtcccc ccgcagaccc 1380 
cggcgaccct caggtcggcc gactgcggca gctggcgccc gaacagggac cctcggataa 1440 
gtgacccttg tctctatttc tactatttgg tgtttgtctt gtattgtctc tttcttgtct 1500 
ggctatcatc acaagagcgg aacggactca ccatagggac caagctagcg cttctcgtcg 1560 
cgtccaagac cctcaaagat ttttggcact tcgtt^agcg aggcgatatc aggtatgaca 1620 
gcgccctgcc gcaaggccag ctgcttgtcc gctcggctgc ggttggcacg gcaggatagg 1680 
ggtatcttgc agttttggaa aaagatgtga taggtggeaa gcacctctgg cacggeaaat 1740 
acggggtaga agttgaggcg cgggttgggc tcgcatgtgc cgttttcttg gcgtttgggg 1800 
ggtacgcgcg gtgagaatag gtggcgttcg taggcaaggc tgacatccgc tatggcgagg 1860 
ggcacatcgc tgcgctcttg caacgcgtcg -cagataatgg cgcactggcg -ctgcagatgc 1920 
ttcaacagca cgtcgtctcc cacatctagg tagtcgccat gcctttcgtc cccccgcccg 1980 
acttgttcct cgtttgcctc tgcgttgtcc tggtcttgct ttttatcctc tgttggtact 2040 
gagcggtcct cgtcgtcttc gcttacaaaa cctgggtcct gctcgataat cacttcctcc 2100 
tcctcaagcg ggggtgcctc gacggggaag gtggtaggcg <:gttggcggc atcggtggag 2160 
gcggtggtgg cgaactcaga gggggcggtt aggctgtcct tctfcctcgac £gactccatg 2220 
atctttttct gcctatagga gaaggaaatg gccagtcggg aagaggagca gcgcgaaacc 2280 
acccccgagc gcggacgcgg tgcggcgcga cgtcccccaa ccatggagga cgtgtcgtcc 2340 
ccgtccccgt cgccgccgcc tccccgggcg -cccccaaaaa agcggatgag gcggcgtatc 2400 
gagtccgagg acgaggaaga ctcatcacaa gacgcgctgg tgccgcgcac acccagcccg 2460 
cggccatcga cctcggcggc ggatttggcc attgcgccca agaagaaaaa gaagcgcect 2520 
tctcccaagc ccgagcgccc gccatcacca gaggtaatcg tggacagcga ggaagaaaga 2580 
gaagatgtgg cgctacaaat ggtgggtttc agcaacccac cggtgctaat caagcatggc 2640 
aaaggaggta agcgcacagt gcggcggctg aatgaagacg acccagtggc gcgtggtatg 2700 
cggacgcaag aggaagagga agagcccagc gaagcggaaa gtgaaattac ggtgatgaac 27€0 
ccgctgagtg tgccgatcgt gtctgcgtgg gagaagggca tggaggctgc gcgcgcgctg 2820 
atggacaagt accacgtgga taacgatcta aaggcgaact tcaaactact gcctgaecaa 2880 
gtggaagctc tggcggccgt atgcaagacc tggctgaacg aggagcaccg cgggttgcag 2940 
ctgaccttca ccagcaacaa gacctttgtg acgatgatgg ggcgattcct gcaggcgtac 3000 
ctgcagtcgt ttgcagaggt gacctacaag catcacgagc -ccacgggctg cgcgttgtgg 30€0 
ctgcaccgct gcgctgagat cgaaggcgag cttaagtgtc tacacggaag cattatgata 3120 
aataaggagc acgtgattga aatggatgtg acgagcgaaa acgggcagcg <egcgctgaag 3180 
gagcagtcta gcaaggccaa gatcgtgaag aaccggtggg gccgaaatgt ggtgcagatc 3240 
tccaacaccg acgcaaggtg ctgcgtgcac gacgcggcct gtccggccaa tcagttttcc 3300 
ggcaagtctt gcggcatgtt cttctctgaa ggcgcaaagg ctcaggtggc ttttaagcag 33€0 
atcaaggctt ttatgcaggc gctgtatcct aacgcccaga ccgggcacgg tcacctttfcg 3420 
atgccactac ggtgcgagtg caactcaaag cctgggcacg cgcccttttt gggaaggcag 3480 
ctaccaaagt tgactccgtt cgccctgagc aacgcggagg acctggacgc ggatctgatc 3540 
tccgacaaga gcgtgctggc cagcgtgcac cacccggcgc tgatagtgtt ccagtgctgc 3600 
aaccctgtgt atcgcaactc gcgcgcgcag ggcggaggcc ccaactgcga cttcaagata 3660 
tcggcgcccg acctgctaaa cgcgttggtg atggtgcgca gcctgtggag tgaaaacttc 3720 
accgagctgc cgcggatggt tgtgcctgag tttaagtgga gcactaaaca -ccagtatcgc 3780 
aacgtgtccc tgccagtggc gcatagcgat gcgcggcaga accccttetga *t£tttaaacg 3840 
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gcgcagacgg caagggtggg ggtaaataat cacccgagag tgtacaaata aaagcatttg 3900 
cctttattga aagtgtctct agtacattat ttttacatgt ttttcaagtg acaaaaagaa 3950 
gtggcgctcc taatctgcgc actgtggctg cggaagtagg gcgagtggcg t:tocaggaag 4020 
ctgtagagct gttcctggtt gcgacgcagg gtgggctgta cctggggact gttgagcatg 4060 
gagttgggta ccccggtaat aaggttcatg gtggggttgt gatccatggg agtttggggc 4140 
cagttggcaa aggcgtggag aaacatgcag cagaatagtc cacaggcggc -cgagttgggc 4200 
ccctgtacgc tttgggtgga cttttccagc gttatacagc ggtcggggga agaagcaatg 4260 
gcgctacggc gcaggagtga ctcgtactca aactggtaaa cctgcttgag tcgctggtca 4320 
gaaaagccaa agggctcaaa gaggtagcat gtttttgagt gcgggttcca ggcaaaggcc 4380 
atccagtgta cgcccccagt ctcgcgaccg gccgtattga ctatggcgca ggcgagcttg 4440 
tgtggagaaa caaagcctgg aaagcgcttg tcataggtgc ccaaaaaata tggcccacaa 4500 
ccaagatctt tgacaatggc tttcagttcc tgctcactgg agcccatggc ggcagctgtt 4560 
gttgatgttg cttgcttctt tatgttgtgg cgttgccggc cgagaagggc gtgcgcaggt 4620 
acacggtttc gatgacgccg cggtgcggcc ggtgcacacg gaccacgtca aagacttcaa 4680 
acaaaacata aagaagggtg ggctcgtcca tgggatccat atatagggcc cgggttataa 4740 
ttacctcagg tcgacctcga gggatctttg tgaaggaacc ^ttacttctgt ggtgtgacat 4800 
aattggacaa actacctaca gagatttaaa gctctaaggt aaatataaaa tttttaagtg 48*60 
tataatgtgt taaactactg attctaattg tttgtgtatt ttagattcca acctatggaa 4920 
ctgatgaatg ggagcagtgg tggaatgcct ttaatgagga aaacctgttt tgctcagaag 4980 
aaatgccatc tagtgatgat gaggctactg ctgactctca acattctact cctccaaaaa 5040 
agaagagaaa ggtagaagac cccaaggact ttccttcaga attgctaagt tttttgagtc 5100 
atgctgtgtt tagtaataga actcttgctt gctttgctat ^tacaccaca aaggaaaaag 5160 
ctgcactgct atacaagaaa attatggaaa aatattctgt aacctttata agtaggcata 5220 
acagttataa tcataacata ctgttttttc -ttactccaca xzaggcataga gtgtctgeta 5280 
ttaataacta tgctcaaaaa ttgtgtacct ttagcttttt aatttgtaaa ggggttaata 5340 
aggaatattt gatgtatagt gccttgacta gagatcataa tcagccatac cacatttgta S400 
gaggttttac ttgctttaaa aaacctccca cacctccccc tgaacctgaa acataaaatg S460 
aatgcaattg ttgttgttaa cttgtttatt gcagcttata atggt^acaa ataaagcaat 5520 
agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc 5S80 
aaactcatca atgtatctta tcatgtctgg atccggctgt ggaatgtgtg tcagttaggg 5640 
tgtggaaagt ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag 5700 
tcagcaacca ggtgtggaaa gtccccaggc tccccagcag gcagaagtat gcaaagcatg 5760 
catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc gcccctaact 5820 
ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat ttatgcagag 5880 
gccgaggccg cctcggcctc tgagctatt<: cagaagtagt gaggaggctt ttttggaggc 594 0 
ctaggctttt gcaaaaagct tcacgctgcc gcaagcactc agggcgcaag ggctgctaaa 6000 
ggaagcggaa cacgtagaaa gccagtccgc agaaacggtg <:tgaccccgg atgaatgtca 6060 
gctactgggc tatctggaca agggaaaacg caagcgcaaa gagaaagcag gtagcttgca 6120 
gtgggcttac atggcgatag ctagactggg cggttttatg gacagcaagc gaaccggaat €180 
tgccagctgg ggcgccctct ggtaaggttg ggaagccctg ccaaagtaaac -tggatggctt 6240 
tcttgccgcc aaggatctga tggcgcaggg gatcaagatc tgatcaagag acaggatgag 6300 
gatcgtttcg catgattgaa caagatggat tgcacgcagg ttctccggcc gcttgggtgg €360 
agaggctatt cggctatgac tgggcacaac agacaatcgg ctgctctgat gccgccgtgt 6420 
tccggctgtc agcgcagggg cgcccggttc tttttgtcaa gaecgacctg txrcggtgecc "6480 
tgaatgaact gcaggacgag gcagcgcggc tatcgtggct ggccacgacg ggcgttcctt -6540 
gcgcagctgt gctcgacgtt gtcactgaag cgggaaggga -ctggctgcta ttgggcgaag 6600 
tgccggggca ggatctcctg tcatctcacc ttgctcctgc cgagaaagta tccatcatgg 6560 
ctgatgcaat gcggcggctg catacgcttg atccggctac <:t^cecattc gaccaccaag €720 
cgaaacatcg catcgagcga gcacgtactc ggatggaagc cggtcttgtc gatcaggatg 6780 
atctggacga agagcatcag gggctcgcgc cagccgaact gttcgccagg ctcaaggcgc €840 
gcatgcccga cggcgaggat ctcgtcgtga cccatggcga tgoctgcttg ccgaatafcca €900 
tggtggaaaa tggccgcttt tctggattca tcgactgtgg ecggctgggt gtggcggacc €960 
gctatcagga catagcgttg gctacccgtg atattgctga agagcttggc ggcgaatggg 7020 
ctgaccgctt cctcgtgctt tacggtatcg ccgctcccga ttcgcagcgc afccgectfcct 7080 
atcgccttct tgacgagttc ttctgagcgg gactctgggg ttcgaaatga -ccgaccaagc 7140 
gacgcccaac ctgccatcac gagatttcga ttccaccgcc gccttctatg aaaggttggg 7200 
cttcggaatc gttttccggg acgccggctg gatgatcctc -cagcgcgggg atctcatgct 7260 
ggagttcttc gcccaccccg ggctcgatcc cctcgcgagt tggtfccagct gctgcctgag 7320 
gctggacgac ctcgcggagt tctaccggca gtgcaaatcc gtcggcafccc aggaaaccag 7380 
cagcggctat ccgcgcatcc atgcccccga actgcaggag tggggaggca cgatggccgc 7440 
tttggtcccg gatctttgtg aaggaacctt acttctgtgg tgtgacataa ttggacaaac 7500 
tacctacaga gatttaaagc tctaaggtaa atataaaatt -tttaagtgta taafcgtgtta 7560 
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aactactgat tctaattgtt tgtgtatttt agattccaac ctatggaact gatgaatggg 7620 
agcagtggtg gaatgccttt aatgaggaaa acctgttttg ctcagaagaa atgccatcta 7680 
gtgatgatga ggctactgct gactctcaac attctactcc tccaaaaaag aagagaaagg 7740 
tagaagaccc caaggacttt ccttcagaat tgctaagttt tttgagtcat gctgtgttta 7800 
gtaatagaac tcttgcttgc tttgctattt acaccacaaa ggaaaaagct gcactgctat 7860 
acaagaaaat tatggaaaaa tattctgtaa cctttataag taggcataac agttataatc 7920 
ataacatact gttttttctt actccacaca ggcatagagt gtctgctatt aataactatg 7980 
ctcaaaaatt gtgtaccttt agctttttaa tttgtaaagg ggttaataag gaatatttga 8040 
tgtatagtgc cttgactaga gatcatiaatc agccatacca <^tt4:gtaga ggttttactt 8100 
gctttaaaaa acctcccaca cctccccctg aacctgaaac ataaaatgaa tgcaattgtt 8160 
gttgttaact tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat 8220 
ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa acfccatcaat 8280 
gtatcttatc atgtctggat ccccaggaag ctcctctgtg tcctcataaa -ccctaacctc 8340 
ctctacttga gaggacattc caatcatagg ctgcccatcc accctctgtg tcctcctgtt 8400 
aattaggtca cttaacaaaa aggaaattgg gtaggggttt ttcacagacc gctttctaag 8460 
ggtaatttta aaatatctgg gaagtccctt ccactgctgt gttccagaag tgttggtaaa 8S20 
cagcccacaa atgtcaacag cagaaacata caagctgtca gctttgcaca agggcccaac 8580 
accctgctca tcaagaagca ctgtggttgc tgtgttagta atgtgcaaaa caggaggcac 8640 
attttcccca. cctgtgtagg ttccaaaata tctagtgttt tcatttttac ttggatcagg_8700. 
aacccagcac, tccactggat aagcattatc cttatccaaa acagcqttgt ggtcagtgtt .8760 
catctgctga' ctgtcaactg tagcattttt tggggttaca gtttgagcag gatatfctggt 8820 
cctgtagttt gctaacacac cctgcagctc caaaggttcc ccaccaacag caaaaaaatg 8880 
aaaatttgac ccttgaatgg gttttccagc accattttca tgagtttttt gtgtccctga. 8940 
atgcaagttt aacatagcag ttaccccaat aacctcagtt ttaacagtaa cagcttccca 9000 
catcaaaata tttccacagg ttaagtcctc atttaaatta ggcaaaggaa tt-cttgaaga 9060 
cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct 9120 
tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa -cccctatttg <tttat£tttc 9180 
taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa 9240 
tattgaaaaa ggaagagtat gagtattcaa cattfcccgtg tcgcccttat tccctttttt 9300 
gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct 9360 
gaagatcagt tgggtgcacg agtgggtcac atcgaactgg atctcaacag cggtaagatc 9420 
cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta 9480 
tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac 9540 
tattctcaga atgacttggt tgagtactca ccagfccacag aaaagcatct tacggatggc 9600 
atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac 9660 
ttacttctga caacgatcgg aggaocgaag gagctaaccg cttttttgca caacatgggg 9720 
gatcatgtaa ctcgccttga tcgttgggaa <:cggagctga atgaagccat accaaacgac 9780 
gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc 9840 
gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt 9900 
gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga 9960 
gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc 10020 
cgtatcgtag ttatctacac gacggggagt caggcaacta ^ggatgaacg aaatagacag 10080 
atcgctgaga taggtgcctc actgattaag cattggtaac 4:gtcagacca agtttactca 10140 
tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc 10200 
ctttttgata atctcatgac caaaatccct taacgtgagt ttt-cgttcca ctgagcgtca 10260 
gaccccgtag aaaagatcaa aggatcttct tgagafccctt tttttctgcg <:gtaatctgc 10350 
tgcttgcaaa caaaaaaacc accgctaeca gcggtggttt gtttgccgga *:<:aagagcta 10380 
ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt 10440 
ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgec tacatacctc 1OS00 
gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg 10S60 
ttggactcaa gacgatagtt accggataag gcgcagcggt -cgggctgaac ggggggttcg 10620 
tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag 10680 
ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc 10740 
agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat 10800 
agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ct-cgfccaggg 10860 
gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc 10920 
tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga -taaccgtatt 10980 
accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca 11040 
gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tcottacgca tctgtgcggt 11100 
atttcacacc gcatatggtg cactctcagt acaat<:tgct ctgatgccgc atagttaagc 11160 
cagtatctgc tccctgcttg tgtgttggag gtcgctgagt agtgcgcgag caaaatttaa 11220 
gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga atctgcttag ggttaggcgt 11280 
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tttgcgctgc ttcgcgatgt acgggccaga tatacgcgta tctgagggga ctagggtgtg 11340 
tttaggcgaa aagcggggct tcggttgtac gcggttagga gtcccctcag gatatagtag 11400 
tttcgctttt gcatagggag ggggaaatgt agtcttatgc aatacacttg tagtcttgca 11460 
acatggtaac gatgagttag caacatgcct tacaaggaga gaaaaagcac -cgtgcatgcc 11520 
gattggtgga agtaaggtgg tacgatcgtg ccttattagg aaggcaacag acgggtctga 11S80 
catggattgg acgaaccact , 11600 

<210> 36 
<211> 53 
<212> DNA 

<213> Artificial Sequence 



<400> 36 

gtcactcgag gactcggtcg actgaaaatg agacatatta tctgccacgg acc 53 



<210> 37 " 
<21I> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 37 

cgagatcgat cacctccggt acaaggtttg gcatag 36 

<210> 38 
<211> 37 
<212> DNA 

<213> Artificial Sequence 



<210> 39 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 39 

gcgacttaag cagtcagctg agacagcaag acacttgctt gatccaaatc c 51 

<210> 40 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<220> 

<223> Description of Artificial Sequence: primer 



<220> 

<223> Description of Artificial Sequence: primer 



<400> 38 

catgaagatc tggaaggtgc tgaggtacga tgagacc 



37 



<400> 40 

cacgaattcg tcagcgcttc tcgtcgcgtc caagaecc 



38 
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<210> 41 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 41 

caccccgggg aggcggcggc gacggggacg gg 32 

<210> 42 
<211> 7231 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid pDV80 
<400> 42 

ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg cgagcaaaat ttaagctaca -60 
acaaggcaag gcttgaccga caattgcatg aagaatctgc ttagggttag gcgttttgcg 120 
ctgcttcgcg atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa 180 
tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa 240 
cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata 300 
atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggac 360 
tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc 420 
cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta 480 
tgggactttc ctacttggca gtacatctac gtattagfcca tcgctattac catggtgatg 540 
cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg atttccaagt 600 
ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg ggactttoca 660 
aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt acggtgggag 720 
gtctatataa gcagagctct ctggctaact agagaaccca ctgcttactg gcttatcgaa 780 
attaatacga ctcactatag ggagacccaa gctggctagc gtttaaactt aagcttggta 840 
ccgagctcgg atccactctc ttccgcatcg ctgtctgcga gggccagctg ttggggtgag 900 
tactccctct gaaaagcggg catgacttct gcgctaagat tgtcagtttc -caaaaacgag 960 
gaggatttga tattcacctg gcccgcggtg atgcctttga gggtggecgc atccatctgg 1020 
tcagaaaaga caatcttttt gttgtcaagc ttggtggcaa acgacccgta gagggcgttg 1080 
gacagcaact tggcgatgga gcgcagggtt tggtttttgt cgcgatcggc gcgctccttg 1140 
gccgcgatgt ttagctgcac gtattcgcgc gcaacgcacc gccattcggg aaagacggtg 1200 
gtgcgctcgt cgggcaccag gtgcacgcgc -caaccgcggt tgtgcagggt gacaaggtca 1260 
acgctggtgg ctacctctcc gcgtaggcgc tcgttggtcc agcagaggcg gccgcccttg 1320 
cgcgagcaga atggcggtag ggggtctagc tgcgtctcgt ccggggggtc tgcgtccacg 1380 
gtaaagaccc cgggcagcag gcgcgcgtcg aagtagt<:ta tctfcgcatcc ttgcaagtct 1440 
agcgcctgct gccatgcgcg ggcggcaagc gcgcgctcgt atgggttgag tgggggaccc 1500 
catggcatgg ggtgggtgag cgcggaggcg tacatgccgc aaatgtcgta aacgtagagg 1560 
ggctctctga gtattccaag atatgtaggg tagcatcttc caccgcggat gctggcgcgc 1620 
acgtaatcgt atagttcgtg cgagggagcg aggaggtcgg gaccgaggtt gctacgggcg 1680 
ggctgctctg ctcggaagac tatctgcctg aagatggcat gtgagttgga tgafcatggtt 1740 
ggacgctgga agacgttgaa gctggcgtct gtgagaccta ccgcgtcacg cacgaaggag 1800 
gcgtaggagt cgcgcagctt gttgaccagc tcggcggtga cctgcacgtc tagggcgcag 1860 
tagtccaggg tttccttgat gatgtcatac ttatcctgtc cctctttttt -ccacagctcg 1920 
cggttgagga caaactcttc gcggtctttc cagtactctt ggatcggaaa cccgtcggcc 1980 
tccgaacgag atccgtactc cgccgccgag ggacctgagc gagtccgcat cgaecggatc 2040 
ggaaaacctc tcgagaaagg cgtctaacca gtcacagtcg caagatccaa gatgaagcgc 2100 
gcccgcccca gcgaagatga cttcaacccc gtctacccct atggctacgc gcggaatcag 2160 
aatatcccct tcctcactcc cccctttgtc tcctccgatg gattcaaaaa -ctt^xrcccct 2220 
ggggtactgt cactcaaact ggctgatcca atcaccatta ccaatgggga -tgtatcectc 2280 
aaggtgggag gtggtctcac tttgcaagat ggaagcctaa ctgtaaaccc taaggetcca 2340 
ctgcaagtta atactgataa aaaacttgag cttgcatatg ataatecatt tgaaagtagt 2400 
gctaataaac ttagtttaaa agtaggacat ggattaaaag tattagatga aaaaagtgct 24^60 
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gcggggttaa aagatttaat tggcaaactt gtggttttaa caggaaaagg aataggcact 2520 

gaaaatttag aaaatacaga tggtagcagc agaggaattg gtataaatgt aagagcaaga 2580 

gaagggttga catttgacaa tgatggatac ttggtagcat ggaacccaaa gtatgacacg 2€40 

cgcacacttt ggacaacacc agacacatct ccaaactgca caattgct<:a agataaggac 2700 

tctaaactca ctttggtact tacaaagtgt ggaagtcaaa tattagctaa tgtgtctttg 27€0 

attgtggtcg caggaaagta ccacatcata aataataaga caaatccaaa aataaaaagt 2820 

tttactatta aactgctatt taataagaac ggagtgcttt tagacaactc aaatcttgga -2880 

aaagcttatt ggaactttag aagtggaaat tccaatgttt cgacagctta tgaaaaagca 2940 

attggtttta tgcctaattt ggtagcgtat ccaaaaccca gtaattctaa aaaatatgca 3000 

agagacatag tttatggaac tatatatctt ggtggaaaac ctgatcagcc agcagtcatt 3O€0 

aaaactacct ttaaccaaga aactggatgt gaatactcta tcacatttaa ctttagttgg 3120 

tccaaaacct atgaaaatgt tgaatttgaa accacctctt ttaccttctc ctatatstgcc 3180 

caagaatgaa agagcggccg ctcgagtcta gagggcccgt ttaaacccgc tgatcagcct 3240 

cgactgtgcc ttctagttgc cagccatctg ttgtttgccc ctcccccgtg ccttccttga 3300 

ccctggaagg tgccactccc actgtccttt cctaataaaa tgaggaaatt gcatcgcatt 33^0 

gtctgagtag gtgtcattct attctggggg gtggggtggg gcaggacagc aagggggagg 3420 

attgggaaga caatagcagg catgctgggg atgcggtggg ctctatggct tctgaggcgg 3480 

aaagaaccag ctggggctct agggggtatc cccacgcgcc ctgtagcggc gcattaagcg 3540 

cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc <:tagcgcccg 3^00 

ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagotc 3660 

taaatcgggg catcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 3720 

aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtt€t-tegec 3780 

ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 3840 

tcaaccctat ctcggtctat tcttttgatt tataagggat tttggggatt tcggcctatt 3900 

ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttaattctgt ggaatgtgtg 3950 

tcagttaggg tgtggaaagt ccccaggctc cccaggcagg cagaagtatg caaagcatgc 4020 

atctcaatta gtcagcaacc aggtgtggaa agtccccagg -ctccccagca ggcagaagta 4080 

tgcaaagcat gcatctcaat tagtcagcaa ccatagtccc gcccctaact ccgcccatoc 4140 

cgcccctaac tccgcccagt tccgcccatt ctccgcccca tggctgacta att-tttttta 4200 

tttatgcaga ggccgaggcc gcctctgcct ctgagctatt ccagaagtag tgaggaggct 4260 

tttttggagg cctaggcttt tgcaaaaagc tcccgggagc ttgtatatce attttcggat 4320 

ctgatcagca cgtgttgaca attaatcatc ggcatagtat atcggcatag tataatacga 4380 

caaggtgagg aactaaacca tggccaagtt gaccagtgcc gttccggtgc tcaccgcgcg 4440 

cgacgtcgcc ggagcggtcg agttctggac cgaccggctc gggttctccc gggacttcgt 4500 

ggaggacgac ttcgccggtg tggtccggga cgacgtgacc ctgttcatca gcgcggtoca 4S60 

ggaccaggtg gtgccggaca acaccctggc ctgggtgtgg gtgcgcggcc tggacgagct 4620 

gtacgccgag tggtcggagg tcgtgtccac gaacttccgg gacgcctccg ggccggccat 4-680 

gaccgagatc ggcgagcagc cgtgggggcg ggagttcgcc ctgcgcgacc cggccggcaa 4740 

ctgcgtgcac ttcgtggccg aggagcagga ctgacacgtg ctacgagatt tcgattpcac 4800 

cgccgccttc tatgaaaggt tgggcttcgg aatcgttttc cgggacgccg gctggatgat 4860 

cctccagcgc ggggatctca tgctggagtt cttcgcccac cccaacttgt ttattgcagc 4920 

ttataatggt tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc 4980 

actgcattct agttgtggtt tgtccaaact catcaatgta tcttatcatg tctgtatacc 5040 

gtcgacctct agctagagct tggcgtaatc atggtcatag ctgtttcctg tgtgaaattg 5100 

ttatccgctc acaattccac acaacatacg agccggaagc ataaagtgta aagectgggg S160 

tgcctaatga gtgagctaac tcacattaat tgcgttgcgc tcactgcocg ctttccagtc 5220 

gggaaacctg tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt 5280 

gcgtattggg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgtfccggct 5340 

gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga 5400 

taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc 5460 

cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg 5520 

ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt tfcccocctgg 5580 

aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc <tgtocgcctt 5640 

tctcccttcg ggaagcgtgg cgctttctca atgctcacgc tgtaggtatc tcagttcggt 5700 

gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg 5760 

cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact tafccgccact 5820 

ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt 5880 

cttgaagtgg tggcctaact acggctacac tagaaggaca gtatttggta «tctgcgcfcct 5940 

gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac 6000 

cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc 6060 

tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg -6120 

ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagafccc -ttttaaatta -6180 



WO 01/83729 PO7EP01/04863 

47 




aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca '6240 
atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc 6300 
ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc 6360 
tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc 6420 
agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat €480 
taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt 6540 
tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt catfccagcfcc €600 
cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag €660 
ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgfctat cactcatggt 67.20 
tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac 6780 
tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg 6840 
cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat €900 
tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc €960 
gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc 7020 
tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa 7080 
atgttgaata ctcatactct tcctttttca atattattga agcatttatc agggttattg 7140 
tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg 7200 
cacatttccc cgaaaagtgc cacctgacgt c t . ' "~ 7231 

<210> 43 - - 

<211> 48 , 
<212> DNA : : 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 43 

tgtcttggat ccaagatgaa gcgcgcccgc cccagcgaag atgacttc 48 



<210> 44 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 44 

aaacacggcg gccgctcttt cattcttg 28 

<210> 45 
<211> 17 
<212> PRT 

<213> Ad 37 N- terminus 
<400> 45 

Met Ser Lys Arg Leu Arb Val Glu Asp Asp Phe Asn Pro Val Tyr Pro Tyr 
1 5 10 IS 

<210> 46 
<211> 6 
<212> PRT 

<213> artificial sequence 
<400> 46 

Lys Arg Ala Arg Pro Ser 
1 5 



<210> 47 
<211> 17 



• 
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<212> PRT 

<213> Ad5 modified N- terminus 
<400> 47 

Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyx Pro Tyr 
15 10 15 
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<211> 24 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 48 

ggatccatgg gatacttggt agca 24 

<210> 49 
<211> 30 

<212>.DNA • . 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 



<400> 49 

gcaactcgag tcattcttgg gcaatatagg 

<210> 50 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence; primer 
<400> 50 

cgcgctgact cttaaggact agtttc 26 

<210> 51 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 51 

gcgcttaatt aacatcatca ataatatacc ttatttt 37 
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